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(57) Abstract 

In accordance with the present invention, it has been 
discovered that nuclear receptor proteins isolated from the 
silk moth bombyx mori (bR) are useful for the regulation 
of transgene expression. bR is generally thought to be 
a strong transcriptional regulator within cells of the silk 
moth. In accordance with the present invention, it has 
been discovered that bR is also functional in mammalian 
cells. It has further been discovered that the addition 
of activation domains to the bR open-reading frame 
enhances the activity of the ligand modulated regulator 
to afford high-level transcriptional induction. Further 
modifications to the bR ligand binding domain result in 
receptors with unique tnwactivational characteristics. 
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Modified Lepidopteran Receptors and Hybrid 
Multifunctional Proteins for Use in Regulation of 
Transaene Expression 
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RELATED APPLICATIONS 

This application claims priority from Provisional 

Application, U. S.S.N, 60/ , filed July 7 , 1998, and 

also claims priority from United States Serial No. 
10 08/891,298, filed July 10, 1997, now pending, the entire 
contents of both of which are hereby incorporated by 
reference herein. 

FIELD OF THE INVENTION 

The present invention relates to methods in the field 
15 of recombinant DNA technology, and products related 
thereto. In a particular aspect, the invention relates to 
methods for modulating the expression of exogenous genes in 
mammalian systems, and products useful therefor. 

BACKGROUND OF THE INVENTION 

20 In the field of genetic engineering, precise control 

*6f gene '^expression ""is " an "*~ ThvaluaSle tooT^'lror"" st:udyTng^ 
manipulating and controlling development and other 
physiological processes. For example applications for 
regulated gene expression in mammalian systems include 

25 inducible gene targeting, overexpression of toxic and 
teratogenic genes, anti-sense RNA expression, and gene 
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therapy (see, for example, Jaenisch, R. (1988) Science 240, 
1468-1474) . For cultured cells, glucocorticoids and other 
steroids have been used to induce the expression of a 
desired gene. 

5 As another means for controlling gene expression in 

mammalian systems, an inducible tetracycline regulated 
system has been devised and utilized in transgenic mice, 
whereby gene activity is induced in the absence of the 
antibiotic and repressed in its presence (see, e.g, Gossen 

10 et al. (1992) PNAS 89, 5547-5551; Gossen et al.(1993) TIBS 
18, 471-475; Furth et al . (1994) PNAS 91, 9302-9306; and 
Shockett et al. (1995) PNAS 92, 6522-6526). However, 
disadvantages of the inducible tetracycline system include 
the requirement for continuous administration of 

15 tetracycline to repress expression and the slow clearance 
of antibiotic from bone, which interferes with regulation 
of gene expression. While this system has been improved by 
the recent identification of a mutant tetracycline 
repressor which acts conversely as an inducible activator, 

20 the pharmacokinetics of tetracycline may hinder its use 
during development when a precise and efficient "on-off" , 
switch is essential (see, e.g., Gossen et al . (1995) 
Science 268, 1766-1769) . 

Historically, the expression of transgenes with 
25 potential pathogenic and cytotoxic properties has been 
difficult in the context of stable cell lines since cells 
chronically producing significant levels of the toxic 
*~ '-^^^^TransgeRe^'g^ie3?a'lly^ c dde *• -§rom ^ t he - ey fcopa fch ic ^e-M^Gfc&^ ^Por^ *~ 
example, CAG repeats and concomitant polyglut amine (polyQ) 
3 0 expression has been linked to a variety of 
neurodegenerative conditions, including Hungtington' s 
disease (Koide et al . (1994) Nat Genet 6:9-13), 
dentatorubro-pallidoluysian atrophy (Ranum et al . (1994) 
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Nat Genet 8:280-284), spinocerebellar ataxias (Kawaguch et 
al. (1994) Nat Genet 8:221-228, Imbert et al . (1993) Nat 
Genet 4:72-76, Sanpei et al . (1996) Nat Genet 14:277-284, 
and Zhuchenko et al . (1997) Nat Genet 15:62-69), and 
5 spinobulbar muscular atrophy (David et al . (1997) Nat Genet 
17:65-70). PolyQ expansion has also been linked to the 
formation of intracellular protein aggregates in vivo in 
transgenic mice expressing human huntingtin fragments with 
expanded repeats (Ordway et al (1997) Cell 91:753-763), 
10 brain tissue from Huntington's patients (DiFiglia et al. 
(1997) Science 277:1990-1993) and transiently transfected 
cultured cells (Martindale et al (1998) Nat Genet 18:150- 
154 and Cooper et al . (1998) Hum Mol Genet 7:783-790). 

In transient transfection models, a 79-Q expanded 
15 repeat from the ataxin 3 gene has been shown to form a 
"punctate pattern" within the cytoplasm, and to result in 
the apopotic cell death of 70-80% of transfected COS-7 
cells within 48 hours of transfection. Paulson et al . has 
reported that 293 cells transfected with a separate 78-Q 
20 ataxin-3 derived construct acquired intranuclear inclusion 
bodies and also underwent apoptotic cell death in a subset 
(30%) of the transfected population with 72 hours of 
transfection. Neuroblastoma cells transiently transfected 
with an 82-Q construct derived from an expanded huntingtin 

2 5 gene exon 1 were found to be 2-3 times more susceptible to 

staurosporine induced apoptosis than cells transfected with 
a LacZ control construct (18) , however, spontaneous cell 
death was not described. Variations in culture techniques, 

3 0 difficult to rigorously characterize the progression of 

polyQ expression, intracellular aggregate (IA) formation, 
and cell death. One impediment to the comparative study of 
IAs in vitro is that transient transfection studies predict 
that toxicity from polyQ over- express ion will prevent the 
35 isolation of stable cultured cells. 
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Accordingly, there is a need in the art for improved 
systems to precisely modulate the expression of exogenous 
genes in mammalian subjects. For example, a non- 
mammalian -based transcription regulating system would be 
5 desirable for general application to transgene regulation 
in in vitro, ex vivo, and in vivo applications, as well as 
transgenic animals. Such systems would be extremely 
desirable to conditionally express pathogenic or toxic 
compounds in order to produce stable cells. A system that 
10 is simple, compact and dependent on ligands which are 
relatively inexpensive, readily available and of low 
toxicity in animals would prove useful for stimulation of 
regulated systems . 

BRIEF DESCRIPTION OF THE INVENTION 

15 In accordance with the present invention, it has been 

discovered that nuclear receptor proteins isolated from the 
silk moth Jbombyx mori (bR) are useful for the regulation of 
transgene expression, bR is generally thought to be a 
strong transcriptional regulator within cells of the silk 

20 moth. In accordance with the present invention, it has 
been discovered that bR is also functional in mammalian 
cells without the addition of exogenous dimer partners 
therefor. It has further been discovered that the addition 
of activation domains to the bR open-reading frame (VbR) 

25 enhances the activity of the ligand modulated regulator to 
afford high-level transcriptional induction (see, e.g., 
Figure 1A) . Further modifications to the bR ligand binding 

characteristics (see, e.g., Figure IB). 

3 0 In accordance with another aspect of the present 

invention, hybrid proteins produced by fusion of modified 
bRs with other ligand-regulated proteins have been found to 
be capable of high level, regulated transactivation of 
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sequences controlled by both hormone response elements and 
tetracycline operators. VbR variants and hybrid proteins 
(see, e.g., Figure 1C) , in combination with the appropriate 
promoters and transgenes, can be introduced into target 
5 cells by common methods such as transfection of plasmids or 
by virus mediated gene transfer. The small size and 
simplicity of these proteins makes them particularly 
attractive for use in retroviral vectors. 

Invention receptor peptides have proven capable of 
10 providing a high degree (>100 fold) of ligand-dependent 
transgene expression in a variety of infected and 
transfected cell types including primary cell types. In 
addition, several transgenes including lucif erase, LacZ, 
human GH, GFP, tyrosine hydroxylase, and the low-affinity 
15 NGF receptor, have all been shown to function in a 
ligand-dependent fashion in cells infected with invention 
constructs . 

Invention methods provide for regulated gene 
expression by exogenous non-mammalian inducers, and 

2 0 therefor can be advantageously employed in a variety of in 
vivo and in vitro mammalian expression systems. For 
example, inducible expression of flp recombinase in 
transgenic mammals, in accordance with invention methods, 
would enable those of skill in the art to accomplish 

25 temporally specific inducible gene targeting of the adult 
or the developing embryo (See, e.g., U.S. Patent Nos . 
5,654,182 and 5,677,177). Stable cell lines conditionally 

study and treatment of diseases. In order to maximize the 
30 chance of obtaining stably transfected/ infected cultured 
cells expressing a desired transgene, such as polyglut amine 
expression cassette (PQEC) , a retroviral construct is 
provided in which expression of the transgene is controlled 
by an exogenous ligand. 
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For example, in order to study possible mechanisms 
underlying Huntington's disease, cultured cell types that 
conditionally express the glutamine (CAG) repeats 
characteristic of the pathological form of the huntingtin 
5 protein are produced. Two aspects of polyQ-mediated 
cytotoxicity are well-suited to characterization in vitro; 
the expression, distribution, and aggregation of proteins 
with expanded polyQ tracts within individual cells, and the 
progression of polyQ expression to death of the cell . The 

10 huntingtin protein is expressed predominantly in the 
cytoplasm of both neuronal and non-neuronal cell types. 
The expression of polyglut amine tracts separately from the 
huntingtin protein will allow one to determine what toxic 
properties, if any, polyglutamine tracts themselves may 

15 harbor. A variable length polyglutamine fused to cytosolic 
reporter proteins such as green fluorescent protein (GFP) 
facilitates the selection and characterization of PQEC 
ligand-responsive cell populations and clonal lines. The 
use of recombinant retroviruses as PQEC transfer vectors 

20 will allow the stable integration of the transgene into a 
wide variety of cultured cell types. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1A provides a schematic drawing of the 
nuclear receptor protein isolated from the silk moth 
25 Jbombyx mori (bR) , as well as a construct containing a 
VP16 activation domain fused at an internal Mlul site 
near the N- terminus of the Jbombyx mori nuclear receptor, 



Figure IB depicts the replacement of several amino 
30 acids of the ligand binding domain of the bombyx mori 
nuclear receptor with sequences from the retinoic acid 
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receptor or the thyroid hormone receptor (VbRRA and 
VbRTR, respectively) . 

Figure 1C provides a schematic of a hybrid 
transactivating construct, designated TTMT, containing a 
5 tetracycline -responsive transact ivator (Tet- responsive 
region) , a VP16 activation domain and a peptide derived 
from bombyx mori . 

Figure 2A shows the induction of fusion proteins 
{VP16 activation domain fused to either Drosophila 
10 ecdysone receptor (dVEcR) or bombyx mori nuclear receptor 
derived (VbR) ) with 5 ^g/ml tebufenozide in the absence of 
exogenous dimer partners . 

Figure 2B shows that a peptide derived from jbombyx 
mori responds to tebufenozide (teb) with greater than 
15 200-fold induction, while Drosophila EcR responds weakly, 
it at all. 

Figure 2C illustrates the action of tebufenozide 
(teb) on the expression of EcRE-Luc reporter when co- 
transfected with the following receptors: 
20 LINX: which encodes the tet -transactivator (TTA) , 

VbR: which encodes VbR, 

TTMT: which encodes the TTA- VbR fusion protein, and 
TTMT-2V: similar to TTMT but with two VP16 t-domains. 

Figure 2D depicts the same experiment illustrated in 
-2£-.-vMguE£^G^ ^ 

only to TTA or fusion proteins in the absence of 
doxycycline. 
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Figure 2E shows the results of an experiment 
employing CV-1 cells transfected with TTMT and two 
reporters: 6TetO-luc and 4 -EcRE-p-galactosidase . 

Figure 3 shows that lower levels of teb are as 
5 effective as higher levels for stimulating full activity. 

Figure 4 is a schematic drawing of representative 
single-plasmid constructs involving VbR use. 

10 Figure 5 illustrates the construction of the pBW 

plasmid . 

Figure 6A provides a comparison of Bombyx mori 
ecdysone receptor (BE) and Drosophila melanogaster ecdysone 
receptor (DE) by transient transfection with both receptors 
15 and the dimer partners Usp or RXR in 293 cells. 

Figure 6B provides a similar comparison as in Figure 
6A, however, in CV-1 cells. 

Figure 7A is a schematic drawing of native and 
chimeric ecdysone receptors. 

20 Figure 7B is a graph depicting the transient 

transfection of VEH into CV-1 cells. 

Figure 8 is a schematic drawing comparing several 
different chimeric receptors. 

Figure 9A provides a schematic of receptor variant 
25 viral vectors and an ecdysone response element responsive 
reporter virus (MS) . 
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Figure 9B illustrates induction levels of luciferase 
in 293, CV-1 and primary fibroblast cells infected with MS- 
luc virus . 

Figure 10 illustrates the construction of the Boris I 
5 plasmid. 

Figure 11 illustrates the construction of the Boris 
III plasmid. 

Figure 12 illustrates the construction of the Boris IV 
plasmid , 

10 Figure 13 illustrates the construction of the pBW-6E- 

GFP-CB-VBE plasmid. 

Figure 14 illustrates the retroviral construct, BORIS 
III transcribes a transgene in the presence and absence of 
a ligand. 

15 Figures ISA and B illustrate the construction of a 

PolyQ expression construct. 

Figure 16 provides a comparison of various polyQ-GFP 
vectors constructed. 



DETAILED DESCRIPTION OF THE INVENTION 

20 In accordance with the present invention, there are 

provided nucleic acid constructs encoding a receptor 
peptide comprising: 

(i) a ligand binding domain and hinge region of 

a non- mammalian member of the nuclear 
25 receptor superfamily, 
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(ii) a DNA binding domain, and 

(iii) an activation domain, 

wherein the receptor peptide activates a regulatory element 
in the absence of an exogenous dimer partner therefor and 
5 in the presence of a ligand for said ligand binding domain. 
Alternatively, there are provided nucleic acid constructs 
encoding a receptor peptide comprising: 

(i) a ligand binding domain and hinge region of a 
non-mammalian member of the nuclear receptor 

10 superfamily, and 

(ii) a DNA binding domain, wherein the DNA 
binding domain is not obtained from the same 
member as the ligand binding domain and 
hinge region, 

15 wherein the receptor peptide activates a regulatory 
element in the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for said ligand 
binding domain. 

In accordance with another embodiment of the present 
20 invention, there are provided method (s) employing such 
constructs. For example, there are provided method (s) for 
modulating the transcription of exogenous nucleic acid(s) 
in a host containing: 

(i) a nucleic acid construct comprising a promoter 
and said exogenous nucleic acid(s) under the 
control of a regulatory element; and 

L.iiX^ 

domain, and the ligand binding domain and hinge 
region of a non-mammalian member of the nuclear 
receptor superfamily which is not normally 
present in the cells of said host, wherein said 
receptor peptide activates said regulatory 
element in the absence of an exogenous dimer 



25 



30 
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partner therefor and in the presence of a ligand 
for said ligand binding domain, 



said method comprising administering to said host an 
amount of ligand effective to modulate the transcription of 
5 said exogenous nucleic acid(s) ; wherein said ligand is not 
normally present in the cells of said host. 

As employed herein, the terms "modulate" and 
"modulating" refer to the ability of a given 
ligand/receptor peptide complex to activate/deactivate 

10 and/or up-regulate/down-regulate transcription of exogenous 
nucleic acids , relative to the transact ivation activity of 
said receptor peptide in the absence of ligand. The actual 
effect of complex formation on the transactivation activity 
of a receptor peptide will vary depending on the specific 

15 ligand and DNA binding domains employed in the receptor 
peptide and on the regulatory element with which the 
ligand/receptor peptide complex interacts. 

As employed herein, the term "host" refers to the 
cell, tissue, organ or organism in need of transcriptional 

20 regulation of exogenous or endogenous nucleic acids. 
Preferably, hosts are mammalian or mammalian derived cells 
or tissue. Exemplary mammals include: humans; domesticated 
animals, e.g., rat, mouse, rabbit, canine, and feline; farm 
animals, e.g., chicken, bovine, porcine and ovine; and 

25 animals of zoological interest, e.g., monkey and baboon; 
and the like. 

As employed herein, a "dimer partner" refers to 
members of the nuclear receptor superfamily to which other 
members preferentially bind to form heterodimeric species. 
3 0 For example, members of the nuclear receptor superfamily 
preferentially form heterodimers with a common partner, the 
retinoid X (or 9-cis retinoic acid) receptor (RXR, see, for 
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example, Yu et al . (1991) Cell 67:1251-1266; Bugge et al . 
(1992) EMBO J. 11:1409-1418; Kliewer et al . (1992) Nature 
355:446-449; Leid et al , (1992) Cell 68:377-395; Marks et 
al. (1992) EMBO J. 11:1419-1435; Zhang • et al . (1992) Nature 
5 355:441-446; Issemann et al . (1993) Biochimie. 75:251-256). 
Additional dimer partners for members of the nuclear 
receptor superfamily include ultraspiracle (Usp) , f arnesoid 
X receptor (FXR) , and the like. 

Receptor peptides utilized in the present invention 

10 are characterized as being fully functional in mammalian 
cells without the addition of any exogenous dimer partners 
therefor. For such receptor peptides, the presence of 
native level or concentration of endogenous dimer partner 
is sufficient to promote transcription. As employed herein, 

15 the phrase "exogenous dimer partner" refers to a dimer 
partner for the receptor peptide that is introduced into 
the host . Exogenous dimer partner may be employed because 
the dimer partner is not present in the host cell (i.e., 
Usp) or because the dimer partner is required at elevated 

20 levels in the host (i.e., higher levels of RXR) . 
Oppositely, the phrase "endogenous dimer partner (s)" refers 
to dimer partners which are native to the unmodified 
mammalian host cell. Invention receptors are advantageous 
because dimer partner is not necessary at elevated levels 

25 in order for formation of heterodimers to occur, thereby 
promoting transcription. 

Members of the nuclear receptor superfamily are 
^chararcter i ; zed ^by^tiie ; presence 
E and F (Evans, R. Science 240:889-895 (1988)), wherein W C" 
3 0 corresponds to the DNA binding domain, W D" corresponds to 
the hinge region, and "E" corresponds to the ligand binding 
domain. In accordance with a particular aspect of the 
invention, it has been discovered that the hinge region 
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(D) , in coordination with the ligand binding domain (E) , of 
certain non- mammalian members of the nuclear receptor 
superfamily enable such members to interact with dimer 
partners endogenous to mammalian cells, i.e., in the 
5 absence of and without requiring introduction of exogenous 
dimer partners. Specifically, it has been identified that 
regions within the hinge region ( W D") and the ligand 
binding domain ( tt E") of the Jbombyx mori receptor function 
in concert to produce high affinity, ligand- dependent 
10 heterodimerization between bR and mammalian dimer partners, 
without requiring the introduction of exogenous dimer 
partners such as Usp. 

Thus, in accordance with a preferred embodiment of the 
present invention, the invention receptor peptide comprises 

15 the hinge region and ligand binding domain derived from 
insect receptors of lepidopteran species such as bombyx 
mori , (Swevers et al . Insect Biochem. Molec. Biol. 
25 (7) :857-866 (1995)), Choristoneura fumiferana (Palli et 
al. Insect Biochem. Molec . Biol. 26 (5) :485-499 (1996)), 

20 Manduca sexta (Fujiwara et al . Insect Biochem. Molec. Biol. 
25(7) :845-856 (1995) ) , Aedes aegypti (Cho et al . Insect 
Biochem Molec. Biol. 25:19-27 (1995), Chorinomus tentans 
(Imhof et al. Insect Biochem. Molec. Biol. 25:115-124 
(199 3) , and the like. Presently preferred insect receptors 

2 5 from which the hinge region and/or ligand binding domain is 

derived substantially lack the C- terminal "F" domain. In 
addition, presently preferred insect receptors share less 
than 50% sequence similarity with Drosophila ecdysone 
receptors , " comparisons perf ormecf * %y * Higgihs ~ "and ' sffarp 

3 0 sequence alignment. 

As employed herein, the term "hinge region" or W D 
domain" refers to the domain located between the DNA 
binding domain and the ligand binding domain of intact 
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members of the nuclear receptor superfamily. In accordance 
with a preferred embodiment of the present invention, the 
hinge region facilitates high affinity interaction with 
dimer partners endogenous to mammalian cells, also referred 
5 herein as "endogenous dimer partners." Preferably, the 
hinge region of a non-mammalian member of the nuclear 
receptor superfamily can be characterized as any sequence 
having substantial sequence identity with amino acid 
residues 273-362 set forth in SEQ ID NO: 3, or substantial 
10 portions thereof which confer sufficiently high affinity 
for endogenous dimer partner. 

The hinge region can be functionally located in either 
orientation and at various positions within the receptor 
peptide. For example, the hinge region can be positioned 
15 at either the amino or carboxy terminus of the receptor 
peptide, or therebetween. In a preferred embodiment of the 
present invention, the hinge region is positioned 
internally between the ligand binding and DNA binding 
domains of the receptor peptide (see Figure 1A) . 

20 As employed herein, the phrase "ligand binding domain 

of a non-mammalian member of the nuclear receptor 
superfamily" refers to ligand binding domains derived from 
receptors which are not endogenous to a mammalian host. 
Ligand binding domains which are not endogenous to a 

25 mammalian host include ligand binding domains which are 
modified to be non- responsive to ligands endogenous or 
native to the host. Ligand binding domains contemplated 

from non-mammalian member (s) of the nuclear receptor 
3 0 superfamily which members are not normally present in the 
cells of a host. Ligand binding domains are preferably 
derived from the carboxy- terminal portion of non-mammalian 
receptors which are capable of activating transcription of 
regulatory elements in the absence of exogenous dimer 
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partner therefore. Exemplary receptors which are not 
normally present in mammalian cells include insect 
receptors, plant receptors, and the like. 

Ligand binding domains can be functionally located in 
5 either orientation and at various positions within the 
receptor peptide. For example, the ligand binding domain 
can be positioned at either the amino or carboxy terminus 
of the receptor peptide, or therebetween. In a preferred 
embodiment of the present invention, the ligand binding 
10 domain is positioned at the carboxy terminus of the 
receptor peptide (see Figure 1A) . 

Exemplary ligand binding domains can alternatively be 
characterized as comprising substantial sequence identity 
with amino acid residues 393-508 ( W E 2 "), preferably amino 

15 acid residues 393-586 (E 2 and E 3l wherein "E 3 " is amino acid 
residues 509-586), as set forth in SEQ ID NO:3, or 
substantial portions thereof (i.e., typically at least 46 
or more contiguous nucleotides thereof) . It has been 
determined that the specific ligand response determinant of 

20 bombyx lies within the E 2 region of the native bR, whereas 
both E 2 and E 3 regions facilitate heterodimer formation with 
endogenous dimer partners. Modifications of this sequence 
contemplated for use in the practice of the present 
invention include replacing several amino acids of the 

25 ligand binding domain with sequences from ligand binding 
domains of other members of the nuclear receptor 
superfamily, such as retinoic acid receptor and/or thyroid 
^^.r«Jp&WQE& pt or, J.Eign re . 1|J. . v ^.Xhe^ 

unique transactivating characteristics and/or eliminate 

3 0 restriction sites, which facilitate the construction of 
useful peptide-based viral constructs. 

DNA-binding domains contemplated for use in the 
preparation of invention receptor peptides are typically 
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obtained from DNA-binding proteins (e.g., transcription 
factors) * The term "DNA-binding domain" is understood in 
the art to refer to an amino acid sequence that is able to 
bind to DNA. As used herein, the term "DNA-binding domain" 
5 encompasses a minimal peptide sequence of a DNA-binding 
protein up to the entire length of a DNA-binding protein, 
so long as the DNA-binding domain functions to associate 
with a particular regulatory element. 

DNA-binding domains are known to function 
10 heterologously in combination with other functional domains 
by maintaining the ability to bind DNA recognition sequence 
(see, e.g., Brent and Ptashne, (1985) Cell, 43:729-736). 
For example, with respect to hormone receptors, DNA-binding 
domains are interchangeable, thereby providing numerous 
15 chimeric receptor proteins (see, e.g., U.S. Patent 
4,981,784; and Evans, R. , (1988) Science, 240:889-895). 
Similar to the ligand binding domain, the DNA-binding 
domain can be positioned at either the carboxy terminus or 
the amino terminus, or the DNA-binding domain can be 
2 0 positioned between the ligand binding domain and the 
activation domain. In preferred embodiments of the present 
invention, the DNA-binding domain is positioned internally 
between the ligand binding domain and the activation 
domain. 

25 "DNA-binding protein(s)" contemplated for use herein 

belong to the well-known class of proteins that are able to 
directly bind DNA and facilitate initiation or repression 

contemplated for use herein include transcription control 
30 proteins (e.g., transcription factors and the like; Conaway 
and Conaway, 1994, "Transcription Mechanisms and 
Regulation", Raven Press Series on Molecular and Cellular 
Biology, Vol. 3, Raven Press, Ltd., New York, NY). 
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Transcription factors contemplated for use herein as 
a source of such DNA binding domains include, e.g., 
homeobox proteins, zinc finger proteins, hormone receptors, 
helix-turn-helix proteins, helix-loop-helix proteins, 
5 basic -Zip proteins (bZip) , 3 -ribbon factors, and the like. 
See, for example, Harrison, S., "A Structural Taxonomy of 
DNA-binding Domains," Mature, 353:715-719. Homeobox DNA- 
binding proteins suitable for use herein include, for 
example, HOX, STF-1 (Leonard et al . , 1993, Mol . Endo., 

10 7:1275-1283), Antp, Mat a-2, INV, and the like. See, also, 
Scott et al. (1989), Biochem. Biophys . Acta, 989:25-48. It 
has been found that a fragment of 76 amino acids 
(corresponding to amino acids 140-215 described in Leonard 
et al., (1993) Mol. Endo. , 7:1275-1283) containing the STF- 

15 1 homeodomain binds DNA as tightly as wild-type STF-1 . 
Suitable zinc finger DNA-binding proteins for use herein 
include Zif268, GLI , XFin, and the like. See also, Klug 
and Rhodes (1987) Trends Biochem. Sci., 12:464; Jacobs and 
Michaels (1990) New Biol . , 2:583; and Jacobs (1992), EMBO 

20 J. , 11:4507-4517. 

An additional DNA binding domain contemplated for use 
in the practice of the present invention is the GAL4 DNA 
binding domain. The DNA binding domain of the yeast GAL4 
protein comprises at least the first 74 amino terminal 
25 amino acids thereof (see, for example, Keegan et al . , 
Science 231:699-704 (1986)). Preferably, the first 90 or 
more amino terminal amino acids of the GAL4 protein will be 
used, with the 147 amino terminal amino acid residues of 

30 Preferably, DNA-binding domain (s) used herein is (are) 

obtained from a member of the nuclear receptor superfamily. 
As used herein, the phrase "member (s) of the nuclear 
receptor superfamily" (also known as "intracellular 
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receptors" or "steroid/thyroid hormone superfamily of 
receptors") refers to hormone binding proteins that operate 
as ligand- dependent transcription factors, including 
identified members of the nuclear receptor superfamily for 
5 which specific ligands have not yet been identified 
(referred to hereinafter as "orphan receptors") . 

Exemplary members of the steroid/thyroid hormone 
superfamily of receptors (including the various isoforms 
thereof) include steroid receptors such as glucocorticoid 

10 receptor (GR) , mineralocorticoid receptor (MR) , estrogen 
receptor (ER) , progesterone receptor (PR) , androgen 
receptor (AR) , vitamin D 3 receptor (VDR) , various isoforms 
of peroxisome prolif erator-activated receptors (PPARs) , and 
the like; plus retinoid receptors, such as the various 

15 isoforms of retinoic acid receptor (e.g., RARa, RARg, or 
RARy) i the various isoforms of retinoid X receptor (e.g., 
RXRa, RXR3/ or RXRy) t and the like (see, e.g., U.S. Patent 
Nos. 4,981,784; 5,171,671; and 5,071,773); thyroid 
receptors (TR) , such as TRa, TR3, and the like; insect 

20 derived receptors such as the ecdysone receptor, and the 
like; as well as other gene products which, by their 
structure and properties, are considered to be members of 
the superfamily, as defined hereinabove, including the 
various isoforms thereof. Examples of orphan receptors 

25 contemplated for use herein as a source of DNA binding 
domain include HNF4 {see, for example, Sladek et al . , in 
Genes & Development 4: 2353-2365 (1990)), the COUP family 
of receptors (see, for example, Miyajima et al . , in Nucleic 

30 Nature 340: 163-166 (1989)), COUP-like receptors and COUP 
homologs, such as those described by Mlodzik et al . , in 
Cell 60: 211-224 (1990) and Ladias et al . , in Science 251: 
561-565 (1991), the insect derived knirps and knirps- 
related receptors, and the like. 
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The DNA-binding domains of all members of the nuclear 
receptor superfamily are related. Such domains consist of 
66-68 amino acid residues, and possess about 20 invariant 
amino acid residues, including nine cysteines. Members of 
5 the superfamily are characterized as proteins which contain 
these 20 invariant amino acid residues. The highly 
conserved amino acids of the DNA-binding domain of members 
of the superfamily are as follows: 

Cys - X - X - Cys - X - X - Asp* - X - Ala* - X - 
10 Gly* - X - Tyr* -X-X-X-X- Cys - X - X - 

Cys - Lys* - X - Phe - Phe - X - Arg* - X - X - 
X-X-X-X-X-X-X- (X-X-) Cys - X -X 

- X - X - X - (X - X - X -) Cys - X -X - X - Lys 

- X - X - Arg - X - X - Cys - X - X - Cys - Arg* 
15 - X - X - Lys* - Cys - X - X - X - Gly* - Met 

(SEQ ID NO:l) ; 

wherein X designates non-conserved amino acids within the 
DNA-binding domain; an asterisk denotes the amino acid 
residues which are almost universally conserved, but for 
20 which variations have been found in some identified hormone 
receptors; and the residues enclosed in parenthesis are 
optional residues (thus, the DNA-binding domain is a 
minimum of 66 amino acids in length, but can contain 
several additional residues) . 

25 It has also been found that in vitro evolution methods 

can be applied to modify and improve existing DNA-binding 

. -^ w ^4Qm^a4nt:,.X^.a e ^.e^ g peyl Xj\ e t_a I ^ 1990 , Science , 24 9. ; 4 0,4- :tj 
406; and Scott and Smith, 1990, Science . 249:386-390). 
Alternatively, DNA-binding domains which are engineered 

30 with novel DNA- recognition specificity (see, e.g., 
Pomerantz et al . Science 267:93-96, 1995, 2FHD1, an 
engineered transcription factor with a composite DNA- 
binding domain) are also contemplated. 
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Two polynucleotides or polypeptides are said to be 
"identical" if the sequence of nucleotides or amino acid 
residues in the two sequences is the same when aligned for 
maximum correspondence . Optimal alignment of sequences for 
5 comparison may be conducted by the local homology algorithm 
of Smith and Waterman, Adv. Appl . Math., 2:482 (1981), by 
the homology alignment algorithm of Needleman and Wunsch, 
J". Mol. Biol., 48:443 (1970), by the search for similarity 
method of Pearson and Lipman, Proc. Natl. Acad. Sci. 
10 (U.S.A.) , 85:2444 (1988), by computerized implementations 
of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the 
Wisconsin Genetics Software Package, Genetics Computer 
Group, 575 Science Dr., Madison, Wis.), or by inspection. 
These references are incorporated herein by reference. 

15 The percentage of sequence identity between two 

sequences is determined by comparing two optimally aligned 
sequences over a window of comparison of at least 20 
positions. The percentage is calculated by determining the 
number of positions at which the identical nucleic acid 

20 base or amino acid residue occurs in both sequences to 
yield the number of matched positions, dividing the number 
of matched positions by the total number of positions in 
the window of comparison (i.e., the window size), and 
multiplying the result by 100 to yield the percentage of 

25 sequence identity. 

For instance, a preferred method for comparing 
sequences uses the GAP program based on the algorithm of 
^» ^--^eedl emany-^ 

all parameters are selected. These are gap weight: 5.0, 
3 0 length weight: 0.30, average match: 1.0, and average 
mismatch: 0.0. 



WO 99/02683 



POYUS98/I42I5 



21 

As used herein, the phrase "substantial sequence 
identity" refers to nucleotide or amino acid sequences 
which share at least 80% sequence identity, preferably 90%, 
more preferably 95 % or more, regardless of the algorithm 
5 used to determine sequence identity, compared to a 
reference sequence over a comparison window of about 20 bp 
to about 2000 bp, typically about 50 to about 1500 bp, 
usually about 350 bp to about 1200. The values of percent 
identity are preferably determined using the GAP program, 

10 referred to above. Another indication that nucleotide 
sequences are substantially identical is if two molecules 
hybridize to each other under stringent conditions . It is 
recognized, however, that proteins (and DNA or mRNA 
encoding such proteins) containing less than the above- 

15 described level of homology produced as splice variants or 
as a result of conservative amino acid substitutions (or 
substitution of degenerate codons) are contemplated to be 
within the scope of the present invention. 

The phrase "substantially the same," used in reference 
20 to a DNA nucleotide sequence, an RNA ribonucleotide 
sequence, or an amino acid sequence / refers to sequences 
that have slight and non- consequential sequence variations 
from the actual sequences disclosed herein. Species that 
are substantially the same are considered to be equivalent 
25 to the disclosed sequences. In this regard, "slight and 
non- consequential sequence variations" mean that sequences 
that are substantially the same as the DNA, RNA, or 
proteins disclosed and claimed herein are functionally 
^^^eqviiv^]^^ ilii^losed^ herein 

30 Functionally equivalent sequences will function in 
substantially the same manner to produce substantially the 
same effect as the sequences disclosed. 

Receptor peptides employed in the present invention 
can be modified by the introduction of activation domains. 
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Activation domains contemplated for use in the practice of 
the present invention are well known in the art and can 
readily be identified by those of skill in the art. 
Activation domains contemplated for use herein are 
5 typically derived from transcription factors and comprise 
a contiguous sequence that functions to activate gene 
expression when associated with a suitable DNA-binding 
domain and a suitable ligand binding domain. An activation 
domain can be positioned at any convenient site within the 
10 receptor peptide, i.e., at the carboxy terminus, the amino 
terminus or between the ligand binding domain and the DNA 
binding domain. In preferred embodiments of the present 
invention, the activation domain is positioned at the amino 
terminus of the receptor peptide. 

15 Suitable activation domains can be obtained from a 

variety of sources, e.g., from the N- terminal region of a 
member of the steroid/thyroid hormone superfamily of 
receptors, from a transcription factor activation domain, 
such as, for example, VP16, GAL4, NF-kB or BP64 activation 

20 domains, and the like. The presently most preferred 
activation domain contemplated for use in the practice of 
the present invention is obtained from the N-terminal 
region of the VP16 protein. 

In a particular embodiment of the present invention, 
25 the invention nucleic acid construct further comprises 
promoters and regulatory elements operatively associated 
with exogenous nucleic acids. In a preferred embodiment of 

t he present JLnyent i on receptor pe pt ide in ^ the _,gresencp ^of ^ 

a ligand therefor, binds the regulatory element and 
30 activates transcription of the exogenous nucleic acids. 

Regulatory elements contemplated for use in the 
practice of the present invention include elements 
responsive to the invention receptor peptide. In a 
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preferred embodiment of the present invention, such 
elements are exogenous regulatory elements not normally 
present in the cells of the host. One class of exogenous 
regulatory elements contemplated for use herein includes 
5 hormone response elements which modulate transcription of 
exogenous nucleic acid when bound to the DNA binding domain 
of an invention receptor peptide. 

Additional regulatory elements that may be utilized in 
the practice of the present invention include exogenous 

10 regulatory elements responsive to a non-mammalian 
transactivator . One such transact ivator- responsive 

regulatory element is an operator which confers 
responsiveness to antibiotics. Exemplary operators 
contemplated for use in this aspect of the invention 

15 include the tetracycline -analog regulated operator, the TET 
operator, the Lac operator, and the like. 

Exogenous response elements contemplated for use 
herein are short cis-acting sequences (i.e., having about 
12-20 bp) that are required for activation of transcription 

20 in response to association with a suitable ligand, such as 
diacyl hydrazines, and an invention receptor peptide. 
Response element sequences contemplated for use herein 
function in a position- and orientation- independent 
fashion. Exemplary response elements include hormone 

25 response elements, GAL4 response elements and the like. 

Hormone response elements contemplated for use in the 
,-,»_ 0 f: t-ho TTre«^rit invention are resoonse elements. , 
which are responsive to members of the nuclear receptor 
superf amily . These response elements comprise at least two 
30 half-sites (in either direct repeat or inverted repeat 
orientation to one another), separated by a spacer of 0-5 
nucleotides. As used herein, the term "half -site" refers 
to a contiguous 6 nucleotide sequence that is bound by a 
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particular member of the nuclear receptor super family. 
Typically, two half -sites with a corresponding spacer make 
up a hormone response element. Hormone response elements 
can be incorporated in multiple copies into various 
5 transcription regulatory regions.' 

Preferred hormone response elements employed in the 
practice of the present invention comprise a first half- 
site and a second half -site, separated by a spacer of 0-5 
nucleotides; 

wherein each half -site has the sequence: 
-RGBNNM-, 
(or complements thereof) wherein 

each R is independently selected from A or G; 
each B is independently selected from G, C, or T; 
each N is independently selected from A, T, C, or G; 
and 

each M is independently selected from A or C; 
with the proviso that at least 4 nucleotides of each 
-RGBNNM- group of nucleotides are identical with the 
nucleotides at comparable positions of the sequence 
-AGGTCA- . 

Exemplary half-sites having the -RGBNNM- motif for use 
in preparing response elements useful in the practice of 
the present invention include, for example, half-sites 
25 selected from -AGGGCA- , -AGTTCA- , -AGGTAA- , -AGGTCA- , 
-GGTTCA- , -GGGTTA- , -GGGTGA- , -AGGTGA- , -GGGTCA- , and the 

,. w _JLAH e -*— ~A__part.icu^ 

-AGTGCA- . 

Additional response elements included the GAL4 
30 response element. Exemplary GAL 4 response elements are 
those containing the palindromic 17-mer: 



15 
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5 1 - CGGAGGACTGTCCTCCG- 3 ' (SEQ ID NO : 4 ) , 

such as, for example, 17MX, as described by Webster et al., 
in Cell 52:169-178 (1988), as well as derivatives thereof. 
Additional examples of suitable response elements include 
5 those described by Hollenberg and Evans in Cell 55:899-906 
(1988); or Webster et al . in Cell 54:199-207 (1988). 

Regulatory elements employed in the practice of the 
present invention are operably linked to a suitable 
promoter for transcription of exogenous nucleic acid(s) 
10 product (s) . When exogenous nucleic acid(s), operatively 
linked to a suitable promoter, is (are) introduced into the 
cells of a suitable host, expression of the exogenous 
nucleic acid(s) is (are) controlled by the presence of 
ligands, which are not normally present in the host cells. 

15 As used herein, when referring to nucleic acids, the 

phrase "exogenous to said mammalian host" or simply 
"exogenous" refers to nucleic acids not naturally found at 
levels sufficient to provide a function in the particular 
cell where transcription is desired. For example, 

20 exogenous nucleic acids can be either natural or synthetic 
nucleic acids, which are introduced into the host in the 
form of DNA or RNA. The nucleic acids of interest can be 
introduced into target cells (for in vitro applications), 
or the nucleic acids of interest can be introduced directly 

25 or indirectly into a host by the transfer of transformed 
cells into a host. 

In contrast to exogenous nucleic acids, the phrase 
"endogenous nucleic acids" or "endogenous genes" refers to 
nucleic acids naturally found at levels sufficient to 
30 provide a function in the particular cell where 
transcription is desired. 
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Exogenous nucleic acids contemplated for use in the 
practice of the present invention include wild type and/or 
therapeutic nucleic acids. 



"Wild type" genes are those that are native to cells 
5 of a particular type. Exemplary wild type nucleic acids 
are genes which encode products: 

the substantial absence of which leads to the 
occurrence of a non-normal state in a host; or 

a substantial excess of which leads to the 
10 occurrence of a non-normal state in a host, 

Such genes may not be expressed in biologically significant 
levels or may be undesirably overexpressed, respectively. 
Thus, for example, while a synthetic or natural gene coding 
for human insulin would be exogenous genetic material to a 
15 yeast cell (since yeast cells do not naturally contain 
insulin genes) , a human insulin gene inserted into a human 
skin fibroblast cell would be a wild type gene with respect 
to the fibroblast since human skin fibroblasts contain 
genetic material encoding human insulin, although human 
2 0 skin fibroblasts do not express human insulin in 
biologically significant levels. 

Therapeutic nucleic acids contemplated for use in the 
practice of the present invention include those which: 

encode products which are toxic to the cells in 
25 which they are expressed; or 

encode products which impart a beneficial 
property to a host; or 
, w ^ J;hp§€ . . ^i c^^ra^pr^be v ^ ^ycj.eic acj^s^ ^^hici.^.j^julat^. 
transcription and/or translation of endogenous genes. 



30 As employed herein, the phrase "therapeutic nucleic 

acids" refers to nucleic acids which impart a beneficial 
function to the host in which such nucleic acids are 
transcribed. Therapeutic nucleic acids are those that are 
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not naturally found in host cells. For example, synthetic 
or natural nucleic acids coding for wild type human insulin 
would be therapeutic when inserted into a skin fibroblast 
cell so as to be expressed in a human host, where the human 
5 host is not otherwise capable of expressing functionally 
active human insulin in biologically significant levels. 
Further examples of therapeutic nucleic acids include 
nucleic acids which transcribe antisense constructs used to 
suppress the expression of endogenous genes. Such 

10 antisense transcripts bind endogenous nucleic acid (mRNA or 
DNA) and effectively cancel out the expression of the gene. 
In accordance with the methods described herein, 
therapeutic nucleic acids are expressed at a level that 
provides a therapeutically effective amount of the 

15 corresponding therapeutic protein. 

Exogenous nucleic acids useful in the practice of the 
present invention include genes that encode biologically 
active proteins of interest, such as, e.g., secretory 
proteins that can be released from said cell ; enzymes that 

20 can metabolize a toxic substance to produce a non-toxic 
substance, or that metabolize an inactive substance to 
produce a useful substance; regulatory proteins; cell 
surface receptors; and the like. Useful genes include 
genes that encode blood clotting factors such as human 

25 factors VIII and IX; genes that encode hormones such as 
insulin, parathyroid hormone, luteinizing hormone releasing 
factor { LHRH ) , alpha and beta seminal inhibins, and human 
growth hormone; genes that encode proteins such as enzymes, 
t he. ^bs e ince„. o f . which^ ljsads^ 

3 0 state; genes encoding cytokines or lymphokines such as 
interferons, granulocytic macrophage colony stimulating 
factor (GM-CSF) , colony stimulating factor-1 (CSF-1) , tumor 
necrosis factor (TNF) , and erythropoietin (EPO) ; genes 
encoding inhibitor substances such as a lpha x - ant i trypsin, - 

35 genes or nucleotides which characterize specific conditions 
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or diseases (i.e., CAG repeat expansion and concomitant 
polyglutamine (polyQ) expression has been linked to a 
variety of neurodegenerative conditions including 
Huntington's disease, dentatorubro-pallidoluysian atrophy, 
5 spinocerebellar ataxias, and spiriobulbar muscular atrophy, 
and the like) ; genes encoding substances that function as 
drugs, e.g., genes encoding the diphtheria and cholera 
toxins; and the like. 

Additional nucleic acids contemplated for use in 
10 accordance with the present invention include genes which 
encode proteins present in dopaminergic neurons (useful, 
for example, for the treatment of Parkinson's disease), 
cholinergic neurons (useful, for example, for the treatment 
of Alzheimer's disease), hippocampal pyramidal neurons 
15 (also useful for the treatment of Alzheimer's disease), 
norepinephrine neurons (useful, for example, for the 
treatment of epilepsy) , spinal neurons (useful, for 
example, for the treatment of spinal injury), glutamatergic 
neurons (useful, for example, for the treatment of 
20 schizophrenia) , cortical neurons (useful, for example, for 
the treatment of stroke and brain injury) , motor and 
sensory neurons (useful, for example, for the treatment of 
amyotrophic lateral sclerosis), and the like. 

Typically, nucleic acid sequence information for 
25 proteins encoded by exogenous nucleic acid(s) contemplated 
for use employed herein can be located in one of many 
public access databases, e.g., GENBANK, EMBL, Swiss-Prot, 
^.^^jan^JPIR.^ or^.in^ r related^ j ourjial publicat ions^ T ^ us / those 
of skill in the art have access to sequence information for 
3 0 virtually all known genes. Those of skill in the art can 
either obtain the corresponding nucleic acid molecule 
directly from a public depository or the institution that 
published the sequence. Optionally, once the nucleic acid 
sequence encoding a desired protein has been ascertained, 
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the skilled artisan can employ routine methods, e.g., 
polymerase chain reaction (PCR) amplification, to isolate 
the desired nucleic acid molecule from the appropriate 
nucleic acid library. Thus, all known nucleic acids 
5 encoding proteins of interest are available for use in the 
methods and products described herein. 

Additional components which can optionally be 
incorporated into invention constructs include selectable 
markers. Selectable markers contemplated for use in the 

10 practice of the present invention include radiolabeled 
molecules, fluorescent molecules, ligands, enzymes, and the 
like. Preferable selectable markers are enzymes such as 
antibiotic resistance genes, genes which enable cells to 
process metabolic intermediaries, and the like. Exemplary 

15 antibiotic resistance genes include genes which impart 
tetracycline resistance, genes which impart ampicillin 
resistance, zeomycin resistance, neomycin resistance, 
hygromycin resistance, puromycin resistance, and the like. 
Genes which enable cells to process metabolic 

20 intermediaries include genes which permit cells to 
incorporate L-histidinol , genes encoding thymidine kinase, 
genes encoding xanthine-guanine phosphoribosyl transferase 
(gpt) # genes encoding dihydrof olate reductase, genes 
encoding asparagine synthetase, and the like. 

25 In accordance with a preferred embodiment of the 

present invention, the invention nucleic acid construct 
further comprises, in addition to* the receptor peptide, a 
np^^ammal i an r a^sac^t iv^gx^nQt^ r jnembgr^^ 
receptor superfamily and not normally present in the ceils 

3 0 of said host, and a compatible transact ivator responsive 
regulatory element not normally present in cells of said 
host. The transactivator responsive regulatory element 
controls transcription of the exogenous nucleic acid(s) or 
a second nucleic acid construct comprising a second 
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exogenous nucleic acid(s). Examples of transact ivator 
responsive regulatory elements include operators which are 
responsive to non-mammalian transact ivators which confer 
responsiveness to antibiotics. Exemplary operators 
5 contemplated for use in this aspect of the invention 
include the tetracycline-analog regulated operator, the TET 
operator, the Lac operator, and the like. Preferably, the 
transact ivator responsive regulatory elements employed in 
the practice of the present invention are operably linked 
10 to a suitable promoter for transcription of exogenous 
nucleic acid{s) proteins. 

Non-mammalian transactivators, other than members of 
the nuclear receptor superfamily, contemplated for use in 
the practice of the present invention function in the 

15 absence of exogenous dimer partner. Examples of 
transactivators that typically function in the absence of 
exogenous dimer partner are tetracycline-controlled 
transactivators, Vpl6-Lac fusion transactivators, and the 
like. When contained as part of a transactivating 

20 construct, the transactivator can be positioned at any 
convenient site within the construct, i.e., at the carboxy 
terminus or the amino terminus of the transactivating 
construct. In preferred embodiments of present invention, 
the transactivator is positioned at the amino terminus of 

25 the transactivating construct (Figure 1C) . 

Preferably, the non-mammalian transactivator confers 
responsiveness to antibiotics . An example of a non- 

antibiotics, as contemplated for use in the practice of the 
3 0 present invention is the tetracycline-controlled 
transactivator. The tetracycline inducible system is well- 
known in the art (see, e.g, Gossen et al . (1992) PNAS 89, 
5547-5551; Gossen et al , (1993) TIBS 18, 471-475; Furth et 
al. (1994) PNAS 91, 9302-9306; Shockett et al . (1995) PNAS 
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92, 6522-6526,- and Hoshimaru et al . (1996) PNAS 93(4): 1518- 
1523) . Other examples of non- mammalian transact ivators 
well known in the art include the IPTG inducible system 
based on a VP16-Lac repressor fusion which functions 
5 through lac operator sequences inserted into heterologous 
promoters {see, e.g., Bairn et al . (1991) PNAS 88:5072- 
5076) . 

In addition to the receptors/transactivators set forth 
above, those of skill in the art recognize that other 

10 transactivators can be used herein, e.g., homeobox 
proteins, zinc finger proteins, hormone receptors, helix- 
turn-helix proteins, helix-loop-helix proteins, basic-Zip 
proteins (bZip) , P-ribbon factors, and the like. See, for 
example, Harrison, S., "A Structural Taxonomy of DNA- 

15 binding Domains," Nature, 353:715-719. Homeobox DNA- 
binding proteins suitable for use herein include, for 
example, HOX, STF-1 (Leonard et al . r (1993) Mol . Endo . , 
7:1275-1283), Antp, Mat a-2, INV, and the like. See, also, 
Scott et al. (1989) Biochem. Biophys. Acta, 989:25-48. It 

2 0 has been found that a fragment of 76 amino acids 
(corresponding to amino acids 14 0-215 described in Leonard 
et al., (1993) Mol . Endo., 7:1275-1283) containing the STF- 
1 homeodomain binds DNA as tightly as wild- type STF-1. 
Suitable zinc finger DNA-binding proteins for use herein 

25 include Zif268, GLI , XFin, and the like. See also, Klug 
and Rhodes (1987) Trends Biochem. Sci., 12:464; Jacobs and 
Michaels (1990) New Biol., 2:583; and Jacobs (1992) EMBO 
J. , 11:4507-4517. 

In a preferred embodiment of the present invention, 
30 expression cassettes are prepared by operably linking 
invention nucleic acid constructs to a suitable promoter 
for expression of the encoded receptor peptide. As used 
herein, the term "promoter" refers to a specific nucleotide 
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sequence recognized by RNA polymerase, the enzyme that 
initiates RNA synthesis. The promoter sequence is the site 
at which transcription can be specifically initiated under 
proper conditions. When nucleic acid constructs, 

5 operatively linked to a suitable promoter, is introduced 
into the cells of a suitable host, expression of the 
nucleic acid construct is conditionally or perpetually 
initiated. 

Promoters contemplated for use in the practice of the 
10 present invention include inducible (e.g., minimal CMV 
promoter, minimal TK promoter, modified MMLV LTR) , 
constitutive (e.g., chicken b-actin promoter, MMLV LTR 
(non- modified) , DHFR) , and/or tissue specific promoters. 

Inducible promoters contemplated for use in, the 
15 practice of the present invention comprise transcription 
regulatory regions that function maximally to promote 
transcription of mRNA under inducing conditions. Examples 
of suitable inducible promoters include DNA sequences 
corresponding to: the E. coli lac operator responsive to 
20 IPTG (see Nakamura et al., Cell, 18:1109-1117, 1979); the 
metallothionein promoter metal-regulatory-elements 
responsive to heavy-metal (e.g., zinc) induction (see Evans 
et al., U.S. Patent No. 4,870,009), the phage T7lac 
promoter responsive to IPTG (see Studier et al . , Meth. 
25 Enzymol., 185: 60-89, 1990; and U.S. #4,952,496), the heat- 
shock promoter; the TK minimal promoter; the CMV minimal 
promoter; a synthetic promoter; and the like. 

Exemplary constitutive promoters contemplated for use 
in the practice of the present invention include the CMV 
3 0 promoter, the SV40 promoter, the DHFR promoter, the mouse 
mammary tumor virus (MMTV) steroid- inducible promoter, 
Moloney murine leukemia virus (MMLV) promoter, elongation 
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factor lor (EFla) promoter, albumin promoter, APO Al 
promoter, cyclic AMP dependent kinase II (CaMKII) promoter, 
keratin promoter, CD3 promoter, immunoglobulin light or 
heavy chain promoters, neurof iliment promoter, neuron 
5 specific enolase promoter, L7 promoter, CD2 promoter, 
myosin light chain kinase promoter, HOX gene promoter, 
thymidine kinase (TK) promoter, RNA Pol II promoter, MYOD 
promoter, MYF5 promoter, phophoglycerokinase (PGK) 
promoter, Stfl promoter, Low Density Lipoprotein (LDL) 
10 promoter, chicken b-actin promoter (used in conjunction 
with ecdysone response element) and the like. 

As readily understood by those of skill in the art, 
the term "tissue specific" refers to the substantially 
exclusive initiation of transcription in the tissue from 

15 which a particular promoter, which drives expression of a 
given gene, is derived (e.g., expressed only in T-cells, 
endothelial cells, smooth muscle cells, and the like) . 
Exemplary tissue specific promoters contemplated for use in 
the practice of the present invention include the GH 

20 promoter, the NSE promoter, the GFAP promoter, 
neurotransmitter promoters (e.g., tyrosine hydroxylase, TH, 
choline acetyltransf erase, ChAT, and the like) , promoters 
for neurotropic factors (e.g., a nerve growth factor 
promoter, NT-3, BDNF promoters, and the like), and so on. 

25 In yet another embodiment of the present invention, 

there are provided constructs comprising a promoter, a 
tetracycline-controlled transactivator , a VP16 activation 

_„ a doma^D.^ and jDombyx^ ,xngri -^derive 

region and ligand binding domain encoding sequence, wherein 

30 the components of the construct are operatively associated 
with the other components of the construct. 

As used herein, the phrase "operatively associated 
with" refers to the functional relationship of DNA with 
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regulatory and effector sequences of nucleotides, such as 
promoters, enhancers, transcriptional and translational 
stop sites, and other signal sequences. For example, 
operative linkage of DNA to a promoter refers to the 
5 physical and functional relationship between the DNA and 
promoter such that the transcription of such DNA is 
initiated from the promoter by an RNA polymerase that 
specifically recognizes, binds to and transcribes the DNA. 

In yet another embodiment of the present invention, 
10 there are provided constructs comprising nucleic acid(s) 
encoding a a VP16 activation domain operatively associated 
with nucleic acid encoding a DNA binding domain and bombyx 
mori -derived hinge region and ligand binding domain. 

In accordance with another embodiment of the present 
15 invention, there are provided gene transfer vectors useful 
for the introduction of invention constructs into suitable 
host cells. Such gene transfer vectors preferably comprise 
a first reporter under the control of a regulatory element, 
a second reporter under the control of an operator which is 
20 responsive to a ligand-mediated receptor which confers 
responsiveness to antibiotics, and a construct comprising 
a promoter and nucleic acid encoding a VP16 activation 
domain, a DNA binding domain and bombyx /nori- derived hinge 
region and ligand binding domain, optionally, a 
25 tetracycline-controlled transactivator . The number of 
copies of regulatory elements can readily be varied by 
those of skill in the art. For example, transcription 

of a particular regulatory element, preferably 2 up to 
30 about 25 copies, more preferably 3 up to about 10-15 
copies, with about 4-6 copies being especially preferred. 

Gene transfer vectors (also referred to as "expression 
vectors") contemplated for use herein are recombinant 
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nucleic acid molecules that are used to transport invention 
nucleic acid constructs into host cells for expression 
and/or replication thereof. Expression vectors may be 
either circular or linear, and are capable of incorporating 
5 a variety of nucleic acid constructs therein. Expression 
vectors typically come in the form of a plasmid that, upon 
introduction into an appropriate host cell, results in 
expression of the inserted nucleic acid. 

Suitable expression vectors for use herein include a 
10 recombinant DNA or RNA construct (s) , such as plasmids, 
phage, recombinant virus or other vectors that, upon 
introduction into an appropriate host cell, result (s) in 
expression of the inserted DNA. Appropriate expression 
vectors are well known to those of skill in the art and 
15 include those that are replicable in eukaryotic cells 
and/or prokaryotic cells and those that remain episomal or 
those which integrate into the host cell genome. 

As used herein, the phrase "transcription regulatory 
region" refers to that portion of a nucleic acid or gene 

20 construct that controls the initiation of mRNA 
transcription. Regulatory regions contemplated for use 
herein, in the absence of the non- mammalian transactivator , 
typically comprise at least a minimal promoter in 
combination with a regulatory element responsive to the 

25 ligand/receptor peptide complex. A minimal promoter, when 
combined with a regulatory element functions to initiate 
mRNA transcription in response to a ligand/receptor peptide 
<?P m P 1 £ x „^_ „ Howe v e r \ v _ trans c r ipti or\ r _ t yp lea 1 1. yw 1 1 1 . not ^ oc cur , 
unless the required inducer (ligand therefor) is present. 

30 Conversely, when the non-mammalian transactivator, other 
than members of the nuclear receptor superfamily, is 
present in the host, the transactivator-responsive 
regulatory element will typically only induce transcription 
in the absence of ligand therefor. 
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Preferably, the transcription regulatory region 
further comprises a binding site for ubiquitous 
transcription factor (s) . Such binding sites are preferably 
positioned between the promoter and the regulatory element. 
5 Suitable ubiquitous transcription factors for use herein 
are well-known in the art and include, for example, Spl . 

Expression vectors suitable for use in the practice of 
the present invention are well known to those of skill in 
the art and include those that are replicable in eukaryotic 
10 cells and/or prokaryotic cells as well as those that remain 
episomal and those that integrate into the host cell 
genome. Expression vectors typically further contain other 
functionally important nucleic acid sequences encoding 
antibiotic resistance proteins, and the like. 

15 Exemplary eukaryotic expression vectors include 

eukaryotic constructs, such as the pSV-2 gpt system 
(Mulligan et al., (1979) Nature, 277:108-114); pBlueSkript 
(Stratagene, La Jolla, CA) , the expression cloning vector 
described by Genetics Institute {Science, (1985) 228:810- 

20 815), and the like. Each of these plasmid vectors are 
capable of promoting expression of the receptor peptide of 
interest . 

In a specific embodiment, a gene transfer vector 
contemplated for use herein is a viral vector, such as 
25 Adenovirus, adeno-associated virus, or herpes -simplex virus 
based vectors, and synthetic vectors for gene therapy, and 

50:1252-1268). Adenovirus and adeno-associated virus are 
extremely suitable as gene transfer vectors to produce 
3 0 receptor peptides. Alternatively, a gene transfer vector 
employed herein is a retroviral vector. Retroviral vectors 
are gene transfer plasmids that have an expression 
construct containing an exogenous nucleic acid residing 
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between two retroviral LTRs. Retroviral vectors typically 
contain appropriate packaging signals (e.g., a retroviral 
psi W packaging signal) , elements which enable invention 
construct integration into a host cell (e.g., a 5* and/or 
5 a 3 1 long terminal repeat (LTR) ) , and/or genes encoding 
proteins required for retroviral packaging (e.g., the pol 
gene, the gag gene and the env gene) , that enable the 
retroviral vector, RNA transcribed using the retroviral 
vector as a template, to be packaged into a viral virion in 
10 an appropriate packaging cell line (see, e.g., U.S. Patent 
4, 650,764) . 

Suitable retroviral vectors for use herein are 
described, for example, in U.S. Patents 5,399,346 and 
5,252,479; and in WIPO publications WO 92/07573, WO 

15 90/06997, WO 89/05345, WO 92/05266 and WO 92/14829, each of 
which is hereby incorporated herein by reference, in their 
entirety. These documents provide a description of methods 
for efficiently introducing nucleic acids into human cells 
using such retroviral vectors. Other retroviral vectors 

2 0 include, for example, mouse mammary tumor virus vectors 
(e.g., Shackleford et al. , (1988) PNAS, USA, 85:9655-9659), 
human immunodeficiency virus (e.g., Naldini et al . (1996) 
Science 272:165-320), and the like. 

Various procedures are also well-known in the art for 
25 providing helper cells which produce retroviral vector 
particles which are essentially free of replicating virus. 
See, for example, U.S. Patent 4,650,764; Miller, Human Gene 
-, w ^*^mr^^ 1 TSS 0) - t*"* "Markowi tzT ^ etr " te a lv; ~ ~&ouTnal^x>£ 

Virology, 61 (4) : 1120-1124 (1988); Watanabe, et al . , 
30 Molecular and Cellular Biology, 3 (12) : 2241-2249 (1983); 
Danos, etal., PNAS, 85:6460-6464 (1988); and Bosselman, et 
al., Molecular and Cellular Biology, 1 (5) : 1797-1806 (1987), 
which disclose procedures for producing viral vectors and 
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helper cells which minimize the chances for producing a 
viral vector which includes a replicating virus. 



Recombinant retroviruses suitable for carrying out the 
invention methods are produced employing well-known methods 
5 for producing retroviral virions. See, for example, U.S. 
Patent 4,650,764; Miller, Human Gene Therapy, 1:5-14 
(1990); Markowitz, et al., Journal of Virology, 61(4):1120- 
1124 (1988); Watanabe, et al., Molecular and Cellular 
Biology, 3 (12) :2241-2249 (1983); Danos , et al . , PNAS, 
10 85:6460-6464 (1988); and Bosselman, et al . , Molecular and 
Cellular Biology, 7 (5) : 1797-1806 (1987). 

Thus, in one embodiment, a recombinant retroviral 
vector can be utilized to express receptor peptide. 
Preferably, the retroviral vector will further comprise a 

15 regulatory element and exogenous nucleic acid under the 
control of the regulatory element. Optionally, the 
retroviral vector can express an antibiotic resistance gene 
(see Figure 1C) . A "covector" can also be utilized to 
provide a nucleic acid construct comprising the promoter, 

2 0 the regulatory element and exogenous nucleic acid and a 
second antibiotic resistance gene. The co-vector carrying 
exogenous nucleic acid also has LTRs modified to promote 
high-level expression of exogenous nucleic acid (s) only in 
the presence of the receptor peptide encoded by the 

2 5 recombinant retrovirus and exogenous ligand therefor. Co- 
infected primary mammalian cells can then be selected using 
both antibiotics, resulting in a cell population that is 

--•- -^ - ^ 

exogenous nucleic acid. 



3 0 The use of invention nucleic acid constructs allows 

the stable integration and expression of an exogenous 
nucleic acid(s) or transgene into a wide variety of 
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cultured cell types. In accordance with another embodiment 
of the present invention, there are provided recombinant 
cells containing invention nucleic acid constructs encoding 
receptor peptides as described herein, optionally further 
5 containing regulatory elements operatively associated with 
exogenous nucleic acid(s) . Alternatively, recombinant 
cells constructs comprising a regulatory element, such as 
the bombyx mori receptor response element, operatively 
associated with exogenous nucleic acids, optionally further 
10 containing invention receptor peptides. 

The amount of exogenous nucleic acid introduced into 
a host can be varied by those of skill in the art . For 
example, when a viral vector is employed to achieve gene 
transfer, the amount of nucleic acid introduced can be 
15 increased by increasing the amount of plaque forming units 
(PFU) of the viral vector. 

Suitable means for introducing (transducing) 
expression vectors containing invention nucleic acid 
constructs into host cells to produce transduced 

20 recombinant cells (i.e., cells containing recombinant 
heterologous nucleic acid) are well-known in the art (see, 
for review, Friedmann, (1989) Science, 244:1275-1281; 
Mulligan, (1993) Science, 260:926-932, each of which are 
incorporated herein by reference in their entirety) . 

25 Exemplary methods of transduction include, e.g., infection 
employing viral vectors (see, e.g., U.S. Patent 4,405,712 
and 4,650,764), calcium phosphate transfection (U.S. 

transfection, electroporation, lipofection (see, e.g., U.S. 
30 Patents 4,394,448 and 4,619,794), cytofection, particle 
bead bombardment, and the like. The transduced nucleic 
acid can optionally include sequences which allow for its 
extrachromosomal (i.e., episomal) maintenance, or the 
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transduced nucleic acid can be donor nucleic acid that 
integrates into the genome of the host. 



Exemplary eukaryotic cells suitable for introducing 
invention expression vectors include, e.g., CV-1 cells, P19 
5 cells and NT2/D1 cells (which are derived from human embryo 
carcinomas), ES cells (embryonic stem cells), COS cells, 
mouse L cells, Chinese hamster ovary (CHO) cells, primary 
human fibroblast cells, human embryonic kidney cells, 
African green monkey cells, HEK 293 (ATCC accession #CRL 

10 1573; U.S. Patent No. 5,024,93 9) , Ltk" cells (ATCC accession 
#CCL1.3), COS-7 cells (ATCC under accession #CRL 1651), 
DG44 cells (dhfr* CHO cells; see, e.g., Urlaub et al . (1986) 
Cell. Molec. Genet. 12: 555), cultured primary tissues, 
cultured tumor cells, neuronal progenitor or precursor 

15 cells, such as hcn/v-myc (a tetracycline sensitive 
retroviral vector selected by G418 resistance) , neuronal 
cells lines such as cerebellum derived neuronal precursors 
and PC12 cells, neurons, primary astrocytes, 
oligodendrocytes, and the like. Presently preferred cells 

20 include CV-1 and 293 cells. 

Invention nucleic acid constructs may be stably 
incorporated into cells or may be transiently expressed 
using methods known in the art. Cells are cultivated under 
growth conditions (as opposed to protein expression 
25 conditions) until a desired density is achieved. Stably 
transf ected mammalian cells may be prepared by transf ecting 
cells with an expression vector having a selectable marker 

dihydrofolate reductase, neomycin resistance, and the 
30 like) , and growing the transfected cells under conditions 
selective for cells expressing the marker gene. To prepare 
transient transf ectants , mammalian cells are transfected 
with a reporter gene (such as the E. coli S-galactosidase 
gene) to monitor transf ection efficiency. Selectable 
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marker genes are typically not included in the transient 
transf ections because the transf ectants are typically not 
grown under selective conditions, and are usually analyzed 
within a few days after transf ection. 

5 In accordance with a still further embodiment of the 

present invention, there are provided transgenic animals 
and methods for producing transgenic animals capable of 
prolonged and regulated expression of exogenous nucleic 
acid(s) , said method comprising introducing into early- 
10 stage embryos or stem cells: 

(i) a nucleic acid construct comprising a promoter 
and said exogenous nucleic acid(s) under the 
control of a regulatory element; 

{ii) nucleic acid encoding a receptor peptide 
15 comprising a DNA binding domain, and the ligand 

binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in the 
cells of said host, wherein said receptor peptide 
20 activates said regulatory element in the absence 

of an exogenous dimer partner therefor and in the 
presence of a ligand for said ligand binding 
domain . 

As used herein, the phrase "transgenic animal" refers 
25 to an animal that contains one or more inheritable 
^.^-^pr^ess^p^ 

nucleic acid(s) under the transcription control of an 
operator and/or hormone response element as described 
herein. 

3 0 Methods of making transgenic animals using a 

particular nucleic acid construct are well-known in the 
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art. When preparing invention transgenic animals, it is 
preferred that two transgenic lines are generated. The 
first line will express, for example, a receptor peptide as 
described above (e.g., VbR) . Tissue specificity is 
5 conferred by the selection of a tissue-specific promoter 
(e.g., T-cell specific) that will direct expression of the 
receptor peptide to appropriate tissue. A second line 
contains a nucleic acid construct comprising a promoter and 
exogenous nucleic acid under the control of a regulatory 
10 element. Preferably, transgenic animals are produced by 
the transfection/inf ection of embryonic stem cells which 
are employed to produce invention transgenic animals 
employing methods known to those of skill in art. 



In a presently preferred embodiment, an invention 
15 transgenic animal contains one or more expression 
constructs containing nucleic acid encoding receptor 
peptide and exogenous nucleic acid under the transcription 
control of a regulatory element. Thus, with tissue 
specific expression of the receptor peptide as described 
2 0 above and timely hormone treatment, inducible gene 
expression can be achieved with spatial, dosage, and 
temporal specificity. 



In yet another embodiment of the present invention, 
there are provided methods of inducing the transcription of 
2 5 an exogenous nucleic acid(s) in a host containing: 

(i) a nucleic acid construct comprising a promoter 

control of a regulatory element; 



(ii) nucleic acid encoding a receptor peptide 
30 comprising a DNA binding domain, and the ligand 

binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
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superfamily which is not normally present in the 
cells of said host, wherein expression of said 
receptor peptide is under the control of an 
inducible promoter, wherein said receptor peptide 
5 activates said regulatory element in the absence 

of an exogenous dimer partner therefor and in the 
presence of a ligand for said ligand binding 
domain; and 



(iii) said ligand for said ligand binding domain, 

10 wherein said ligand is not normally present 

in the cells of said host; 

said method comprising subjecting said host to 
conditions suitable to induce expression of said receptor 
peptide . 



15 In accordance with yet another embodiment of the 

present invention, there are provided methods for the 
expression of recombinant products detrimental to a host 
organism, said method comprising: 

transforming suitable host cells with: 
20 (i) a nucleic acid construct comprising a promoter 

and exogenous nucleic acid(s) which express said 
recombinant product under the control of a 
regulatory element ; wherein said regulatory 
element is not normally present in the cells of 
25 said host, and 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
3 0 superfamily which is not normally present in the 

cells of said host, wherein said receptor peptide 



WO 99/02683 



PCT/US9S/14215 



44 

activates said regulatory element in the absence 
of an exogenous dimer partner therefor and in the 
presence of a ligand for said ligand binding 
domain, and 

5 growing said host cells to the desired level in the 

substantial absence of ligand for said receptor peptide; 
and 

inducing expression of said recombinant product by 
introducing into said host cells a ligand, which, in 
10 combination with said receptor peptide, binds to said 
regulatory element and activates transcription therefrom. 

Recombinant products detrimental to a host organism 
contemplated for expression in accordance with the present 
invention include any gene product that functions to confer 

15 a toxic effect on the organism. For example, inducible 
expression of a toxin such as the diptheroid toxin would 
allow for specific ablation of tissue (Ross et al . Genes 
and Development 7:1318-1324 (1993)). Moreover, the 
numerous gene products that are known to induce apoptosis 

20 in cells expressing such products are contemplated for use 
herein (see, e.g, Apoptosis, The Molecular Basis of Cell 
Death, Current Communications In Cell & Molecular Biology, 
Cold Spring Harbor Laboratory Press, 1991) . For example, 
high level expression of 79-amino acid polyglutamine tracts 

25 in cells results in apoptosis and rapid cell death. 

-^..^.^.^ w ^^i-i^^-TCcordanGe=«-wi?fch^-sfe*i'll^ * -another-^* emfeo^i-ment^~<z>£ v -the - 
present invention, there are provided nucleic acid 
constructs and cell lines which express a polyglutamine 
expression cassette (PQEC) under the control of an 
3 0 exogenous ligand to maximize obtaining stable 
transf ected/inf ected cultured cells which do not undergo 
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undesired apoptosis. Expression of polyglutamine tracts 
separately from the huntingtin protein facilitates 
identification of the toxic properties associated with the 
polyglutamine tracts themselves. In another embodiment of 
5 the present invention, there are provided methods for 
employing these cell lines to identify novel proteins that 
could target developing intracellular aggregates (IAs) and 
either block subsequent growth or facilitate or mark the 
IAs for degradation. 

10 In accordance with still another embodiment of the 

present invention, there are provided methods for 
modulating the transcription of nucleic acid(s) in an in 
vitro system, said method comprising administering to said 
system an amount of a ligand effective to modulate the 

15 transcription of said nucleic acid(s) ; wherein said ligand 
is not normally present in said cellular system; wherein 
said system comprises : 

(i) a nucleic acid construct comprising a promoter 
and said nucleic acid(s) under the control of a 

20 regulatory element; and 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 

25 superfamily which is not normally present in the 

cells of said host, wherein said receptor peptide 
-r .-«r<*~™. r^-v *^-r^-^ctiyafeeSr.-said^.egia-la=t ory, .elements in^.the^abssnce,, 
of an exogenous dimer partner therefor and in the 
presence of a ligand for said ligand binding 

30 domain, 

In accordance with yet another embodiment of the 
present invention, there are provided methods for the 
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treatment of a host in need of gene therapy, said method 
comprising: 



introducing into cells of said host: 

(i) a nucleic acid construct comprising a promoter 
5 and said exogenous nucleic acid{s) under the 

control of a regulatory element; 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 

10 mammalian member of the nuclear receptor 

superfamily which is not normally present in the 
cells of said host, wherein expression of said 
receptor peptide is under the control of an 
inducible promoter, wherein said receptor peptide 

15 activates said regulatory element in the absence 

of an exogenous dimer partner therefor and in the 
presence of a ligand for said ligand binding 
domain . 

administering, to said host, ligand for said ligand 
20 binding domain. 



As used herein, the term "in vivo delivery" refers to 
delivery of biological materials by such routes of 
administration as oral, intravenous, subcutaneous, 
intraperitoneal , intrathecal , intramuscular , intracranial , 
25 inhalational, topical, transdermal, suppository (rectal), 
pessary (vaginal), and the like. 

As employed herein, the term "ligand" (or ligand 
precursor) refers to a non-steroidal substance or compound 
which, in its native state (or after conversion to its 
3 0 "active" form) , binds to the receptor peptide, thereby 
creating a ligand/receptor peptide complex, which in turn 
can bind an appropriate response element and activate 
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transcription therefrom. Ligands function to modulate 
transcription of nucleic acid(s) maintained under the 
control of a response element. In accordance with one 
aspect of the present invention, unless and until a 
5 suitable ligand is administered to the host, substantially 
no transcription of the desired exogenous nucleic acids 
occurs . 

Preferred ligands contemplated for use in the practice 
of the present invention are ligands characterized as not 
10 normally present in the cells of the host to be treated. 
Such ligands are referred to as being exogenous to the 
host. An example of a class of ligands not naturally 
present in mammalian systems are compounds referred to as 
hydrazines, preferably diacyl hydrazines. 

15 Hydrazines contemplated for use in the present 

invention include compounds which are readily available and 
are/or relatively inexpensive to manufacture. One such 
compound, tebuf enozide, is a non-steroidal ecdysone agonist 
which is used commercially as an insecticide. This 

20 compound specifically targets lepidopteran species, 
including bombyx mori . Tebuf enozide has undergone 
extensive testing in animal hosts and has proved to be of 
very low toxicity to mammals and other non-insect species. 

Exemplary hydrazines contemplated for use herein are 
25 mimics of the naturally occurring ecdysones, i.e., 
synthetic organic compounds which have transactivation 

r:::-:-r -activ.it y,^^ - ^arac teri stxc ^ol r ^ the ..-.-naturally,, ^..occ-Urriiig. 

ecdysones. Examples of such compounds, referred to herein 
as ecdysone mimics, include 1,2-diacyl hydrazines (e.g., 
30 tebuf enozide and others described in U.S. Patent Nos . 
5,424,333 and 5,354,762, the entire contents of each of 
which are hereby incorporated by reference herein) , 
N' -substituted-N, N 1 -di -substituted hydrazines (e.g., those 
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described in U.S. Patent No. 5,117,057, the entire contents 
of which are hereby incorporated by reference herein) , 
dibenzoylalkyl cyanohydrazines (e.g., those described in 
European Application No. 461,809, the entire contents of 
5 which are hereby incorporated by reference herein) , 
N-substituted-N-alkyl-N,N ' -diaroyl hydrazines (e.g., those 
described in U.S. Patent No. 5,225,443, the entire contents 
of which are hereby incorporated by reference herein) , 
N-substituted-N-acyl-N-alkyl, carbonyl hydrazines (e.g., 

10 those described in European Application No. 234,944, the 
entire contents of which are hereby incorporated by 
reference herein), N-aroyl -N ' -alkyl-N 1 -aroyl hydrazines 
(e.g., those described in U.S. Patent No. 4,985,461, the 
entire contents of which are hereby incorporated by 

15 reference herein), and the like. 

Since it has been previously reported that the above- 
described diacyl hydrazines are neither toxic, teratogenic, 
nor known to affect mammalian physiology, they are ideal 
candidates for use as inducers in cultured cells and 
20 transgenic mammals according to invention methods. 

Ligands are administered in a manner compatible with 
the route of administration, the dosage formulation, and in 
a therapeutically effective amount. The required dosage 
will vary with the particular treatment desired, the degree 
25 and duration of therapeutic effect desired, the judgment of 
the practitioner, as well as properties peculiar to each 
individual . Moreover, suitable dosage ranges for systemic 
.^__ t .,a^pJ^^ 

anticipated that dosages between about 10 micrograms and 
30 about 1 milligram per kilogram of body weight per day will 
be used for therapeutic treatment . 

An effective amount of ligand contemplated for use in 
the practice of the present invention is the amount of 
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ligand (e.g., diacyl hydrazine (s) ) required to achieve the 
desired level of transcription and/or translation of 
exogenous nucleic acid. A therapeutically effective amount 
is typically an amount of a ligand or ligand precursor 
5 that, when administered in a physiologically acceptable 
composition, is sufficient to achieve a plasma 
concentration of the transcribed or expressed nucleic acid 
product from about 0.1 pg/ml to about 100 /ig/ml, preferably 
from about 1.0 fxg/ml to about 50 fig/ml, more preferably at 
10 least about 2 /xg/ml and usually 5 to 10 /-tg/ml . 

Ligand can be administered in a variety of ways, as 
are well-known in the art, i.e., by any means that produces 
contact between ligand and receptor peptide. For example, 
such ligands can be administered topically, orally, 

15 intravenously, intraperitoneally, intravascularly , and the 
like. The administration can be by any conventional means 
available for use in conjunction with pharmaceuticals, 
e.g., by intravenous injection; either as individual 
therapeutically active ingredients or in a combination with 

20 other therapeutically active ingredients. Ligands 
contemplated for use in the practice of the present 
invention can be administered alone, but are generally 
administered with a pharmaceutical carrier selected on the 
basis of the chosen route of administration and standard 

25 pharmaceutical practice. 

Therapeutic compositions containing suitable ligand 
are preferably administered intravenously, as by injection 

used in reference to a therapeutic composition of the 
3 0 present invention, refers to a quantity of ligand suitable 
as unitary dosage for the subject, each unit containing a 
predetermined quantity of active material calculated to 
produce the desired therapeutic effect in association with 
the required diluent, i.e., carrier, or vehicle. It may be 
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particularly advantageous to administer such compounds in 
depot or long- lasting form as discussed hereinafter. 

Suitable regimes for initial administration and 
. booster shots are variable, but are typified by an initial 
5 administration followed by repeated doses at one or more 
intervals by a subsequent injection or other 
administration. Alternatively, continuous intravenous 
infusion sufficient to maintain concentrations in the blood 
in the ranges specified for in vivo therapies are 
10 contemplated. 

In accordance with still another embodiment of the 
present invention, there are provided methods for the 
treatment of a host in need of gene therapy, said method 
comprising : 



15 introducing into cells obtained from said host: 

(i) a nucleic acid construct comprising a promoter 
and exogenous nucleic acid(s) under the control 
of a regulatory element ; and 

(ii) nucleic acid encoding a receptor peptide 
20 comprising a DNA binding domain, and the ligand 

binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in the 
cells of said host, wherein said receptor peptide 
25 activates said regulatory element in the absence 

presence of a ligand for said ligand binding 
domain ; and 



to provide modified cells; 
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reintroducing the modified cells into said host; and 
administering, to said host, ligand for said ligand 
binding domain. 

In accordance with this embodiment of the present 
5 invention, the exogenous nucleic acid is introduced 
directly into cells obtained from a donor (host or separate 
donor) . Such cells are then implanted within the host. In 
a presently preferred embodiment, the transplanted cells 
are autologous with respect to the host . Autologous means 
10 that the donor and recipient of the cells are one and the 
same . 

The concept of gene replacement therapy for humans 
involves the introduction of functionally active "wild 
type" or "therapeutic" nucleic acids into the somatic cells 
15 of an affected host to correct a gene defect or deficiency. 
However, in order for gene replacement therapy to be 
effective, it must be possible to control the time and 
location at which gene expression occurs. 

Genes that encode useful "gene therapy" proteins that 
20 are not normally transported outside the cell can be used 
in the invention if such genes are "functionally appended" 
to a signal sequence that can "transport" the encoded 
product across the cell membrane. A variety of such signal 
sequences are known and can be used by those skilled in the 
25 art without undue experimentation. 

by reference to the following non-limiting examples. 
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Example 1 

Construction of Lepidopteran- derived fusion receptor 

peptide 

The VP16 t -domain (activation domain) is fused in 
5 frame onto bombyx mori- derived nuclear receptor (bR) at an 
internal Mlul site near the N- terminus (VbR, Figure 1A) . 
An AscI site immediately downstream of the ATG start codon 
allows mult imerizat ion of VP16 domains for VbRs with 
multiple VP16 activation domains. Note that unlike 

10 Drosophila ecdysone receptor (dEcR) , bR has virtually no 
C-terminal domain downstream of the LBD. Several amino 
acids of the hormone binding domain are replaced with 
sequences from the retinoic acid receptor or the thyroid 
hormone receptor {VbRRA and VbRTR, respectively) (Figure 

15 IB) . These mutant variations have properties slightly 
different from VbR, and in addition have silent mutations 
that eliminate Bell and BstBl sites in VbR. These are 
useful in the construction of many VbR-based retroviral 
constructs. 

20 Example 2 

Construction of multifunctional regulatory proteins 

The small size and simplicity of the VbR system lends 
itself to the development of multifunctional regulatory 
proteins which are functional on both hormone receptor and 

25 TTA responsive promoters. VbR fusion to the tetracycline 
^o.^^,rran^a.ct^ y-a.tpr^B suits . in ^bytai&PJWfc^i^ 

Tebufenozide/Tetracycline Modulated Transact ivator; see 
Figure 1C) . This protein functions jointly as a 
ligand-modulated regulator of gene expression from both 

30 tetO and EcRE (ecdysone response element) -containing 
promoters either separately or simultaneously. This 
compact protein, encoded in approximately 3 kb, confers 
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tetracycline operators (tetOs) in the absence of 
tetracycline/analogs * In the presence of tet analogs, the 
tetO-binding function of TTMT is blocked, deactivating 
5 responsive promoters. The VbR half of the chimeric protein 
is constitutively inactive on EcRE-containing promoters, 
but may be stimulated to transactivate responsive promoters 
to a high level by the addition of muristerone A (murA) or 
tebufenozide (teb) . Through such a dual system, two 

10 separate responsive promoters may be simultaneously 
regulated by two separate ligands through a single protein. 
Additional variants using the reverse -Tet- transact iva tor, 
other VbR variants, novel response elements, and other 
regulatory proteins, will presumably result in further 

15 customized variations on this same theme. 

A single transgenic animal generated with the TTMT 
protein would be simultaneously responsive to transgene 
regulation of two separate transgenes by two separate 
ligands. Further, individual promoters can be produced 
20 that respond to both halves of the protein, allowing the 
transgene to be regulated by both ligands, at the 
discretion of the investigator. 

Example 3 
Transient transfection assay 

25 Transient transfection experiments are carried out in 

CV-1 cells or 293 cells using modified drosophila ecdysone 

with or without dimer partners. Luc activity is examined 
40 hours after transfection and stimulation with ligand 
30 (Figure 2A) . Both receptors are assayed in CV-1 cells with 
1 /aM MurA and- no exogenous dimer partner. Note that bR 
responds/ while dEcR does not. Both receptors are also 
assayed in the presence of 5 /xg/ml tebufenozide (Figure 
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2B) . Note that VbR responds with greater than 2 00 -fold 
induction while drosophila EcR does not respond at all, and 
in fact, decreases slightly. Both receptors are also 
assayed with and without dimer partners. VbR is stimulated 
5 with 5 /xg/ml tebuf enozide and VdEcR is stimulated with 1 
MurA. Note that RXR and Usp are necessary for VdEcR while 
VbR is constitutively activated by the presence of 
exogenous dimer partner and is inhibited by ligand 
addition. VbR, with no exogenous dimer partner, has an 
10 extremely low base-line which is stimulated 200-300 fold by 
ligand. Note also from this experiment that VdEcR tends to 
have a much higher baseline than VbR without dimer partner. 



A side-by side comparison of 2.5 fxg/ml teb and 1 fxM 
MurA on VbR show both ligands are effective at stimulating 

15 VbR. Tebufenozide is administered to cells co-transf ected 
with nucleic acids encoding receptors: LINX (which encodes 
the tet-transactivator (TTA) ) , VbR, TTMT (which encodes the 
TTA-VbR fusion protein, and TTMT-2V which is like TTMT, but 
with two VP16 t-domains) to assay for EcRE-Luc activity 

20 (Figure 2C) . Note that teb stimulates VbR, and the TTMT 
fusions, but has no action on TTA (LINX) . The same 
experiment as that described for 2C, only with a TetO-Luc 
reporter, which responds only to TTA or fusion proteins is 
summarized in Figure 2D. Doxycycline (dox) acts to block 

25 constitutive activity of TTA. Note that in this 
experiment, TTMT and TTMT- 2 V work better than TTA only 
(LINX) to stimulate the TetO-luc promoter/reporter. They 

.. w .^ar&^alsQ^ , ^„,,e^p : erimen t^.gn XJfcJU- 

cells transf ected with TTMT and two reporters (6TetO-luc 

3 0 and 4-EcRE-ftgalactosidase) shows both promoters could be 
simultaneously influenced by the presence or absence of 
both ligands (Figure 2E) . 
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Example 4 

Liaand mediated transgene regulation 

Transgene regulation by teb or MurA in stably infected 
rat fibroblasts is assayed utilizing recombinant 
5 retroviral constructs as the reporter, and VbR encoding 
retroviral vectors to provide the receptor. Bulk infected 
selected populations stimulated with 2.5 /xg/ml teb or 1 jiM 
MurA indicate b-gal or tyrosine hydroxylase infected cells. 
Quantitative analysis of an individual VbR-luc clonal 
10 fibroblast line, and a VbR-GH clonal line show that 0.25 
fig/ml teb is as effective at stimulating full activity as 
higher levels (Figure 3) . 

Example 5 

Single-plasmid retroviral constructs 

15 The plasmid, pBO, utilizes a responsive internal 

promoter to regulate transgene expression and also contains 
a VbR-responsive 3' LTR to autoregulate the VbR receptor 
itself (Figure 4) . The plasmid, pNA, is constructing using 
a 3 ' -responsive LTR to regulate transgene expression. pBO 

20 utilizes an EcR responsive internal promoter and a TTA 
responsive 3 f LTR. Both are host to regulation by the TTMT 
protein, simultaneously. The purpose of this design is to 
provide chronic stimulation of the duplicated 5 ' LTR along 
with regulated expression of the internal EcR responsive 

25 promoter. By removing all of the native promoting 
sequences from the 6To LTR with the exception of the 

Example 6 
Construction of plasmid vectors 

30 VbR and TTMT systems including constitutive and 

inducible promoters are integrated into a modular plasmid 
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vector known as "pBW" (Figure 5) . The purpose of pBW is to 
integrate all of the elements of bombyx receptor- or 
tetracycline-analog responsiveness into a single plasraid 
vector destined for use in producing transgenic animals by 
5 methods such as pronuclear injection. pBW is to facilitate 
construction of transgenic animals or stably transfected 
cells using VbR variants. The pBW design simplifies the 
insertion of responsive promoters, transgenes, constitutive 
promoters, and the various VbRs, TTMTs , and related 
10 regulatory proteins. With pBW / the transgene regulatory 
properties of generated animals should be more predictable 
and reproducible from founder to founder than multi-vector 
systems . 



There are two major components of this system: the 
15 first is a tissue-specific expression cassette to produce 
the receptor and/or antibiotic resistance genes. The 
second is an expression cassette with a responsive promoter 
and a transgene. Polyadenylation signals flank all of the 
expression cassettes to provide efficient polyadenylation 
20 (p(A)) of transcribed RNAs. The use of rare restriction 
endonuclease sites within strategic locations and 
polylinkers of pBW will ease cloning of transgenes . An 
adjunct shuttle vector derived from pBSK (Stratagene) 
called SKSP contains rare sites flanking the polylinker and 
25 compatible with pBW. pBW also shares a number of sites 
with pBO to further simplify construction. pBW is 
constructed within the plasmid pcDNA3 (Invitrogen) . 



Example .7., . 
Regulating transgene 
30 expression in target cells in vitro 



Individual desired elements can be inserted 
sequentially into a recombinant retrovirus. Recombinant 
retrovirus starts as a polylinker composed of 5* Notl-Mlul- 
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Nrul-EcoRI-Ascl-Pmll-BstBl'BamKI-Hindlll-Hpal-Clal-Nsil- 
Kpnl 3' sites. The retroviral V gin (see Adam and Miller 
in J. Virol. 62:3802 (1988) and Barklis et al . , in Cell 
47:391 (1986)) is inserted into sites of the recombinant 
5 retroviral polylinker. The internal CMV promoter is 
inserted BainHI-Clal into the BamHI-Clal sites of the 
evolving recombinant retroviral vector. All LTRs destined 
for insertion into the 5' location are produced by low- 
cycle/high- fidelity PCR production and end primers with 

10 compatible Notl-Mlul sites for insertion into the Notl-Mlul 
sites. 3' LTRs are generated with Clal-Nsil compatible 
ends for insertion into these sites. The R region of the 
3' LTR is included in the downstream primer sequence. 
Transgenes, receptor cDNAs , or selectable marker genes are 

15 inserted into the remaining polylinker from the EcoRI site 
through the BamHI site, or the remaining polylinker after 
the internal promoters (Hindi I I -Clal) . 

The resulting vectors are characterized by restriction 
digests and mapping, amplified by large-scale plasmid 

20 preparation, and prepared for transfection into packaging 
cells by standard methods. Transient retroviral production 
in 2 93 and 293T cells has been previously described (see 
Pear et al . , in PNAS 90:8392 (1993)). Forty-eight hours 
after transient retroviral production, the conditioned 

25 media is removed, filtered through 0.45 mm filters, and 
frozen at -70°C until use. 10-cm dishes of primary- cultured 
Fisher rat abdominal fibroblasts (Rosenberg et al . , Science 
242:1575 (1988)) at approximately 50% density are infected 
with 1/10 volume of virus -containing media and 8 ycq 

30 polybrene/ml for 48 hours. Infected cells are then 
selected under the appropriate antibiotic until non- 
infected cells are cleared from the population 
(approximately 10 days) . Resistant colonies are 

trypsinized, pooled and passaged. The resulting population 
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is then infected with the second virus, and reselected 
again in media containing both antibiotics. The infection 
and selection process for the second round of viral 
infection proceeds as described for the first round. The 
5 final population of doubly resistant primary fibroblasts 
are then pooled and passaged to 6 or 24 -well Costar plates 
for assay in vitro. 

The short (24 hour) course is performed in 24 -well 
Costar dishes with cells at approximately 80% confluency. 

10 5 fil of ligand (final concentration of 1 /iM) or vehicle 
(20% EtOH) is added to the 1 ml of media in each well. 
Eight hours after first ligand administration, the cells 
are all washed in PBS and the media is replaced. 
galactosidase histochemical reaction is performed on 1*5% 

15 glutaraldehyde-f ixed cells essentially as described by 
Shimohama et al . in Brain Res Mol Brain Res 5:271 (1989) in 
a 3 7°C environment for 2 hours. The 4 -day time course is 
performed in an identical fashion, except that the cell 
population is plated at only 20% confluency at the start of 

20 the experiments (and grow to near -total confluency by the 
end of the 4 day period) and neither ligand nor media is 
changed or replaced. 

For long-term culture experiments, MMBG fibroblasts 
are plated in triplicate at -high density in 24 -well plates. 

25 Within 48-72 hours of plating, when the cells reach 100% 
confluency within the wells, 1 jiM ligand or vehicle is 
added to one group of plates, while others are left 
compl.eJi&l^utt.tx.eat .ed.. .- ^The^e LI s. -&r&Jt hejwpl ac^cU-ia- ,a-* 3*7 
10% C0 2 environment for the next 25 days without any media 

30 changes. On day 23, the plates that had received neither 
vehicle nor ligand on day 1 are removed and half of the 
wells are given vehicle and the other half are given 1 
ligand for 4 0 hours. 
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After 25 days of nutrient deprivation and contact 
inhibition, all of the plates are processed for Pgal 
histochemical staining as above. 



MMGH fibroblasts are produced by infecting a 
5 fibroblast line with the recombinant retroviral vectors 
containing desired elements. Cells are selected under both 
antibiotics as described previously. After preliminary 
examination of ligand- induced hGH production in the bulk 
population, the MMGH population is plated at high density 

10 in 6 -well Costar plates. Forty-eight hours after the 
initial plating and when the cells are essentially 
completely growth inhibited by contact, the medium of all 
plates is replaced with DME containing 2% FBS . Under these 
conditions, primary rat fibroblasts stop dividing even if 

15 they are not confluent, and settle out into a distended 
morphology with prominent nuclei characteristic of severely 
growth-arrested fibroblasts. Six days after 

acclimatization to this culture environment, 1 ml of medium 
is harvested from each well and plate of the experiment and 

2 0 stored frozen at -20°C. The remaining unused media is 
discarded at the same time of harvest and replaced with 
fresh DME-2% FBS. Groups of wells are treated or not with 
1 fiM ligand. Hormonally treated wells are washed with 
medium to remove as much residual ligand as possible prior 

2 5 to medium replacement when ligand treatment is 

discontinued. This routine is repeated daily for the 21- 
day extent of the experiment . 

^ ^^,M&££^ jcq 11 set e.4 . and ^ t 

frozen at -20°C, they are simultaneously processed for the 

3 0 presence of the hGH protein by ELISA (Boehringer Mannheim) 

following the protocol recommended by the manufacturer. 
ELISA data are quantitated on an MR700 microplate reader 
(Dynatech Laboratories, Chantilly, VA) and compared to a 
standard curve generated using purified hGH to determine 
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picogram GH amounts. Control wells are TH (tyrosine 
hydroxylase) producing fibroblasts, but display all of the 
characteristics of the recombinant retrovirus except for 
the hGH transgene and secreted protein. 

5 Example 8 

Expression Vectors and Transient Transfecti on Assays 

All receptors were subcloned into vector L.NCX (A.D. 
Miller, GenBanlc accession no. M28247, with an extended 
polylinker) for use in transient transfection assays. VBR 

10 (or CVBR) when referring to the retrovirus) was produced by 
insertion of VP16 (14) sequences with a synthetic ATG start 
codon into the amino- terminal region of BE up to, and 
in- frame with, the Mlul site (corresponding to amino acid 
26). VP1 6 primers: (5') 5 ' -GAGAGAAGCTTATGGCGCGCCCGACCGATG 

15 ( SEQ ID NO : 5 ) and ( 3 ' ) 5 ' - CACACACGCGTGTACTCGTCAATTCCAAG 
(SEQ ID NO: 6) . VDE (or CVDE) was produced in a similar 
fashion by fusing VP16 sequences in- frame at a novel Ncol 
site corresponding to amino acid 68. The downstream primer 
was made Ncol compatible. Plasmid template (100 ng) , each 

20 primer (500 ng) , and reaction conditions outlined by the 
manufacturer for Pwo (Boehringer Mannheim) high fidelity 
polymerase, were used with a program of 1 min at 94 °C, 
1 min at 45°C, and 1 min at 72°C for 20 cycles for 
production of all PGR products used herein. LNCX-Usp and 

25 RXR were produced from KcoRI fragments encoding this 
complete cDNAs , filling with the Klenow fragment of DNA 
polymerase, and inserting the fragments into the Hpal site 

endonuclease digestion. Transient transfection in CV-l and 
30 293 cell was performed by using standard methods (Sambrook 
et al (1989) Molecular Cloning: A Laboratory Manual 2 nd 
edition) in triplicate in 24 -well plates by calcium 
phosphate coprecipitation of 100-ng receptor (s) , reporter 
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plasmid E4-luc, and pCHHO as an internal control. 
Briefly, E4-luc is four tandem EcREs inserted upstream of 
a thymidine kinase gene minimal promoter directing 
luciferase expression EcRE oligonucleotides were as 
5 described (Thomas et al . (1993) Nature 362:471-475) with 
BamHI-Bg/ll compatible ends, Ligand (1 /xM) was added at 
the time of transf ection, and 40 h later cells were lysed 
and extracts were used for p-galactosidase assay and 
measurement of luciferase activity in an analystical 
10 bioluminescence photometer. 

CVDEiR and MS vectors used only as retroviruses in 
production of stable cell lines were produced as follows: 
The VDEiR expression cassette was constructed by modifying 
the ATG start codon of human RXRa to contain an overlapping 
15 BstXI site for fusion into the ATG start codon of the 
0.8-kb EMCV IRES (Jang et al (1989) J. Virol 63:1651-1660 
and Hoshimaru et al (1996) PNAS 93:1518-1523) sequence. 
The IRES-RXR cassette was then subcloned downstream of VDE 
in LNCX-ofVDE (CofVDE) , a VDE variant vector with lower 

2 0 basal transactivation levels in superphysiological RXR 

environments, to produce CVDEiR. The MS retroviral 
reporter was produced by removing the Moloney murine 
leukemia virus enhancer core from the 3' long terminal 
repeat (LTR) with Whel-Xbal digestion and replacing this 
25 sequence with four tandem EcREs , resulting in a reporter 
analogous to the E4-luc reporter plasmid. 

Example 9 

Gel mobility shift analysis was performed by in vitro 

3 0 translation of receptor ORPs subcloned into pBSK 

(Stratagene) , pGEM-3 (Promega) , and PSL301 (Invitrogen) in 
the presence of [ 3S S] Met by using T3/T7 TNT (Promega) 
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transcript ion/ translation and following the manufacturer's 
protocol. In vitro translated proteins were qualitatively 
examined by 5% SDS/PAGE with protocols as described 
elsewhere (Sambrook et al (1989) Molecular Cloning: A 
5 Laboratory Manual 2 nd edition) and quantified by 
Phosphor Imager (Molecular Dynamics) exposures of dried 
gels. Amounts of proteins used in gel shift assays were 
normalized by using quantitative data and correction for 
predicted Met residues in individual constructs. 

10 Double -stranded EcRE probes corresponding to response 
elements described above were labeled by filling of the 
Klenow fragment of DNA polymerase with [ 32 P] dCTP and 
unlabeled dGAT by standard methods. Reaction conditions 
for protein-probe interaction and gel electrophoresis were 

15 essentially as described by Yao et al . ((1992) Cell 71:63- 
72) except, to facilitate comparison between samples, 
reaction mixtures (including dimer partners) and probe) 
were prepared as a mixture and distributed equally to 
individual tubes with receptor proteins. The reactions 

20 were allowed to proceed at 23°C for 5 min. at which time 
ligand or vehicle was added and the reaction allowed to 
continue for an additional 20 min. 



Example 10 

Construction of bombvx mori receptor (BE) - drosophila 
25 melanocraster ecdvsone receptor (DE) Chimeri c Receptors 

Chimeric receptors were produced by taking advantage 
of conserved DNA sequences and sites within the receptors 
_ ^. w ^^^,:.,.v ian d«^y=:' n irrt r oduc ing 7 ~ oompa t i-bl^ ^s rtes ^by— low-- cycleT^' high — 
fidelity PCR (described above) of bombyx mori receptor 
30 (BE) and drosophila melanogaster ecdysone receptor (DE) 
templates to produce fusion proteins. DEBE-A/B was 
produced by PCR and replaced the BE /AB domain (amino acids 
1-197, as set forth in SEQ ID NO:3) with amino acids 1-255 
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of DE introducing a novel Apal restriction site. DEBE-C 
replaced the A/B and C domains (amino acids 1-273, as set 
forth in SEQ ID NO: 3) of BE with amino acids 1-331 of the 
DE protein by introduction of a compatible Eagl site into 
5 BE at the fusion junction. DEBE-D replaced BE from amino 
acids 1-3 63, as set forth in SEQ ID NO: 3, with amino acids 
1-430 of DE by Kpnl digest. BEDE was the reverse of this 
construct and replaced amino acids 1-430 of DE with amino 
acids 1-363 (as set forth in SEQ ID NO:3) of BE by using 

10 Kpnl partial digest. DEBH was produced from DEBE-C by 
removing all BE sequences except the D domain by Kpnl 
digestion and inserting sequences encoding the intact E and 
F domains (amino acids 430-878) of the original DE after 
partial Kpnl digestion. VEH was produced by excision of 

15 the central region of DEBH, including the heterologous 
hinge domain, and inserted into VDE, replacing the 
analogous region. 

Example 11 

Construction of E Domain Chimeric Receptors 

20 The chimera, BEDB , was produced by insertion of the 

6 54 -bp KpnI-Bg/II fragment encoding precisely the DE 
hormone -binding domain into the identical analogous sites 
in BE. Two unique sites within the hormone -binding domain 
of BEDB were used in the production of the chimeras. A 

25 unique Aatll site lies approximately 12 0 bp downstream of 
the 5' Kpnl site, and a unique EagI sites lies 
approximately 440 bp downstream. PCR was used as described 
above, and corresponding fragments of BE with appropriate 
compatible ends were generated and subcloned into BEDB 

30 digested with Kpnl + Aa til (BKE) , Aatll + Eagl (BAE) , Aatll 
+ Bg/II (BAB), and Eagl-Bg/II for BEB. The resulting 
constructs were translated, examined by SDS/PAGE, 
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quantified, and normalized as described above. Conditions 
for gel mobility shift assay were also as described above. 

Example 12 

Infection of Cells with Recombinant Retroviruses 

5 Transient restroviral production in 293 cell types has 

been described {Pear et al . ((1993) PNAS 90:8392-8396). 
Forty-eight hours after transient retroviral production, 
the conditioned medium was removed, filtered through 
0.45-/xm filters, and frozen at - 70°C until use. 

10 Ten-centimeter dishes of primary- cultured Fischer rat 
abdominal fibroblasts (Schinstine et al . (1992) Neurochem 
58:2019-2029) and 293 and CV-1 cells at approximately 50% 
density were infected at a multiplicity of infection of 
approximately 0.05 with virus -containing media and 8 mg/ml 

15 Polybrene for 48 h. Infected cells were then selected with 
4-6 mM L-histidinol until noninfected cells were cleared 
from the population: approximately 10 d for FF12s, 21 
dafor 293s, and >30 d for CV-ls (CV-l cell populations were 
resistant to rigorous selection with l . -histidinol) . 

20 Resistant colonies from multiple plates were trypsinized, 
pooled, and passaged. The resulting bulk population was 
then split and infected with receptor-bearing virus and 
reselected again in medium containing both L-histidinol and 
400-800 fig/ml G418. The infection and selection process 

25 for the second round of viral infection proceeded as 
described for the first round. The final population of 
doubly resistant primary fibroblasts was then pooled and 

^ ^passage^A^ 

P-galactosidase staining by standard methods or luciferase 
30 assay. 
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Example 13 

CV-1 Cells Support BE-Mediated Transact ivat ion Without 
Exogenous Dimer Partner 

Both receptors were tested to determine the effect 
5 that differential endogenous dimer partner availability had 
on transactivation of responsive promoters. Relative 
lucif erase activity employing the reporter plasmid, E4-luc, 
were performed from VDE and VBR transiently cotransf ected 
into 293 or CV-l cells with either Usp, RXR, or no added 
10 heterodimer partner. Lucif erase activity was assayed in 
the presence of either vehicle, 1 /xM murA, or 1 juM 
tebuf enozide . 

In the presence of an equimolar exogenously added 
dimer partner, VDE and VBR transactivation was similar in 

15 both cell types. With Usp, both proteins were induced less 
than 2- fold by ligand and displayed a high level of basal 
transactivation. With RXR, VDE displayed an average 
relative 5.35-fold induction across both cell types, 
whereas VBR was induced only 2.35-fold, even though the 

2 0 absolute level of induction matched or exceeded the 
expression with VDE. The decreased relative induction of 
VBR + RXR resulted from approximately doubled basal 
activity levels compared with VDE + RXR. With no exogenous 
dimer partner, both receptors in both cell types exhibited 

25 dramatic 15- to 80-fold decreases in basal transactivation. 



to tebuf enozide , whereas VBR continued to respond well to 
both murA and tebuf enozide. The addition of RXR to 
VDE- transf ected 293 cells increased the maximum 
3 0 murA- treated expression level by only 20%, indicating that 
the high level of endogenous RXR in 293 cells is near 
saturation for heterodimerization with DE. By comparison, 
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CV-1 cells supported murA stimulation of VDE at only 13% of 
the expression level of VDE with added RXR (Fig. IS) , VBR 
was active at levels similar to VDE in the high RXR 
background of the 293 cells; however, in the CV-1 cells, 
5 tebuf enozide- stimulated VBR exhibited both the highest 
absolute level of transact ivat ion and the greatest relative 
induction (160.2-fold) of any of the other combinations of 
receptors and ligands tested. In the CV-1 cells, 
tebuf enozide- stimulated VBR displayed 21-fold greater 
10 relative induction and an absolute expression level 9.25 
times the level of VDE treated with mutA. 

Example 14 

BE Displays Facilitated Heterodimer Formation with Usp 

and RXR 

15 The high level function of VBR in the low heterodimer 

partner environment of the CV-1 cells suggested that BE 
might be capable of efficient function with lower levels of 
dimer partner than DE. To explore this possibility, 
cell -free experiments with gel mobility shift analysis and 

2 0 equivalent amount of in vitro translated BE and DE in 

combination with Usp or RXR dimer partners and both ligands 
were performed. Neither protein alone formed a dimeric 
complex with the EcRE probe unless either the Usp or RXR 
dimer partner proteins were added. In the presence of Usp, 
25 DE formed a comparatively weak interaction with the EcREs 
that was increased approximately 4-fold by murA and 2- fold 
by tebuf enozide . In contrast, BE produced a strong shift 
-*r. ~ - *-^^.^^*wj&fo^> -Usp*- r-tha ~ was ^■©nly.^- s lightrly , -^inf 1 uenced^by^ei feher>^. 

ligand. In the absence of any ligand, BE bound probe 5 

3 0 times more efficiently than DE. With ligand, the BE probe 

shift was observed to still exceed the DE shift and was 
double the DE + Usp band volume. Both receptors combined 
with RXR displayed an absolute dependence on ligand for 
heterodimer formation; however, whereas BE + RXR with 
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either ligand displayed a prominent shifted band, an 
equivalent amount of DE resulted in a barely detectable 
shifted band with murA and no detectable band shift with 
tebuf enozide . The BE + RXR tebuf enozide- and murA-induced 
5 shifts were determined to be >15 times the DE + RXR shift 
with murA. 

Example 15 

BE-DE Chimeras Reveal That the D Domain Is Critical to 
High Affinity Heterodimerization 

10 Gel mobility shift assays indicated that BE had higher 

affinity heterodimer formation than DE. To further define 
the subdomains and molecular determinants resulting in the 
BE high affinity phenotype, PCR mutagenesis and internal 
shared restriction endonuclease sites were used to produce 

15 BE-DE chimeric receptors that were assayed for their 
ability to bind the EcRE probe with either Usp or RXR and 
with or without ligand. Because previous results indicated 
that murA stimulated both EcRs, for purposes of direct 
comparison only murA was used for gel shift analysis of the 

20 chimeras. Fusion constructs began at the N terminus of BE 
and sequentially replaced the A/B (DEBE-A) , C (DEBE-C) , 
and D (DEBE-D) domains with DE sequences. BEDE was the 
reverse, encoding BE up to the D-E domain boundary with the 
DE E and F domains. DEBE-A and DEBE-C with Usp shifted 

25 nearly as efficiently as native BE. The absence of* shift 
for DEBE-D, however, and the substantial shift seen for 
BEDE + Usp suggested that the high affinity determinant for 

domain. The same constructs in combination with RXR 
30 revealed a different shift pattern. BE and DEBE-A both 
responded strongly to ligand to bind and shift the probe. 
DEBE-C with ligand was approximately 40% decreased relative 
to native BE; however, it still displayed 16 -fold greater 
binding than native DE. DEBE-D had lost all high affinity 
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DNA-binding complex formation and functioned 
indistinguishably from the native DE with or without 
ligand. MurA-stimulated BEDE + RXR produced a band shift 
4 times the intensity of the native DE + RXR stimulated 
5 with murA, but BEDE was clearly significantly impaired to 
heterodimerization with RXR compared with native BE and 
other higher affinity chimeras like DEBE-A and DEBE-C. 

For further confirmation of the role of the BE D 
domain in dimer partner affinity, the D region of DE was 

10 replaced with the BE D domain to produce DEBH. Native DE 
and DEBH were compared side by side for binding to EcREs 
with both Usp and RXR dimer partners. Gel mobility shift 
of in vitro translated and normalized DE and 
hinge -substituted DEBH was performed, in the presence of 

15 either Usp or RXR, and further in the presence of vehicle 
or 1 /xM murA. Compared with DE + Usp, DEBH + Usp averaged 
5-fold greater probe binding with murA and 9- fold greater 
binding without ligand. DEBH + RXR exhibited both 
decreased probe shift in the absence of ligand and 

2 0 increased probe binding with murA (approximately 3- fold 
over native DE) for a 10- fold relative induction by ligand 
compared with 2. 5- fold for DE. Transient transfection 

analysis of a VP16-DEBH construct (VEH) in CV-1 cells 
revealed that VEH shared characteristics of both DE and BE. 

25 VEH displayed the high basal transactivation level and 

low relative induction (average of 2.5 fold) characteristic 
of BE with superphysiological RXR. VEH displayed 5- to 

native DE in the absence of any exogenous dimer partner. 
30 MurA-stimulated VEH expression exceeded even the expression 
level for murA- stimulated BE by nearly 2 -fold. As 
predicted, without the BE hormone -binding domain, DEBH did 
not to respond to tebuf enozide . Additional chimeric 
proteins with a subset for BED-region sequences did not 
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function as well as DEBH with replacement of the entire D 
domain, suggesting that multiple discrete determinants 
presumably over much of the hinge region contribute to the 
high affinity phenotype. 

5 Example 16 

Discrete Functional Regions Within the E Domain Revealed 

by BE -DE Chimeras 

Although the preceding data reveal that the BED domain 
alone was sufficient for high affinity heterodimerization 

10 with Usp, a comparison of the relative shift of the DEBE-D 
and BEDE chimeras with both partners suggested that 
additional determinants within the BE hormone -binding E 
domain contributed to a high affinity interaction with RXR. 
To further study the BE E domain, chimeric proteins were 

15 produced by using BEDE as a template. Five E-domain 
chimeras named BEDB , BKE, BAE, BAB , and BEB were prepared. 
BEDB is identical with BE with the exception of complete 
replacement of the E domain with corresponding sequences 
from DE, BEDB was identical with BEDE with the exception of 

20 replacement of the large C- terminal DE F domain for the 
20-amino acid F domain of BE. BKE, BAE, BAB, and BEB are 
derived from BEDB and are sequential, C- terminal moving 
replacements of three regions within the DE-derived E 
domain. Gel mobility shift assays with BE, BEDE, and BEDB 

25 revealed that BEDE and BEDB are essentially identical in 
their properties of complex formation, suggesting that the 
F domain does not play a significant role in 

^ heterj^imerdz^t^ 

recapitulates the high affinity USP binding and the low 

3 0 affinity RXR binding of BEDE. 

The BEDB-derived chimeric receptors reveal several 
functional subdomains over most of the BE E domain. By 
using unique internal sites, the E domain was subdivided 
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into approximately thirds called Ej E 2 and E 3 . Treatments 
were with vehicle (-) , 1 /iM murA (M) , or 1 }M tebufenozide 
(T) . Chimera BKE with replacement of the DE E x and E 2 , 
regions and chimera BAE with replacement of only the E 2 
5 region displayed very similar patterns of shift. Both 
chimeras were significantly impaired in complex formation 
with Usp and RXR relative to the original BEDB chimera, 
suggesting that there were fundamental incompatibilities 
between subdomains of the Bombyx and Drosophila E domains . 

10 The similarity of these two constructs suggested that DE 
and BE E x regions were likely to be functionally similar, 
but that the E 2 regions had very different properties. 
Curiously, although both basal and murA- stimulated 
heterodimerization were inhibited, BKE /BAE -Usp 

15 heterodimerization was significantly stimulated by 
tebufenozide, indicating that the tebufenozide response 
determinant is within E 2 but that the high affinity RXR- 
binding determinant lies elsewhere. 

When E 2 was combined in tandem with E 3 in the chimeric 

2 0 receptor BAB, high affinity heterodimer formation 

indistinguishable from native BE heterodimerization was 
observed with both Usp and RXR. To determine whether the 
E 3 region alone could confer high affinity 
heterodimerization with RXR, the chimera BEB , with 
25 substitution of only the E 3 region, was produced and tested. 
BEB displayed a pattern of response essentially identical 
with the original BEDB with low-level interaction with RXR 
and only upon stimulation with murA. BEB, which lacked the 

3 0 Taken together, these results indicate that the BE D domain 

and E 3 region, in combination with ligand-binding 
determinants within the E 2 region, function in concert to 
produce high affinity, ligand- dependent heterodimerization 
between BE and the RXR protein. 
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Example 17 

Comparison of VBR and VDE Function in Stably Transduced 

Cells 

Building on transient transfection and gel shift data, 
5 DE an BE variants were directly compared for their ability 
to function at low or single copy in stably transduced cell 
types. In addition to direct comparison of CVDE and CVBR 
retroviruses, an RXR-bearing VDE variant called CVDEiR was 
also tested that previous studies had determined to 

10 provide the best combination of low basal expression and 
relatively high induction for VDE cotransf erred with 
supplemental RXR. These three receptor-bearing vectors and 
an empty LNCX control were used to infect 293, CV-1, and 
primary fibroblast cells harboring an EcRE- responsive 

15 retroviruses termed MS. MS encoded a 3' LTR with the core 
enhancer elements replaced with four tandem EcREs and an 
internal simian virus 40 promoter vector directing 
resistance to the drug L-histidinol . Parallel MS vectors 
encoding either the LacZ gene (MS-Z) or a lucif erase 

20 transgene (MS-luc) were used in this comparative study, the 
former as an indication of the number of responding cells 
and the latter for quantitative purposes. 

Three cell types, 293s, CV-Is, and rat primary 
fibroblasts (line FF12) , were selected for assay of 
25 reporter/receptor virus combinations based on their 
relative capacities to supports high level DE-mediated 
transactivation. The human 293 cell line support 

added RXR, whereas CV-ls require the contransf ection of 
30 exogenous dimer partner fo^r DE function. FF12s were 
selected as the third cell type because they are refractory 
to chemical methods of transduction and reliant on the use 
of biological vectors such as recombinant retroviruses for 
efficient gene transfer. Furthermore, primary fibroblasts 
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are frequently used as autologous donor cells for 
transplantation approaches to somatic gene therapy. 



All three cell types were infected with both the 
reporter and receptor-bearing vectors and treated for 72 h 
5 with either vehicle, 1 yCtA murA or 1 /iM tebuf enozide . 
Histochemical staining of MS-Z infected cells indicated 
that all three receptor types were capable of high level 
ligand-responsive transact ivat ion in coinfected 293 cells. 
DE-derived constructs responded well to murA but were 

10 unable to respond strongly to tebufenozide. CVBR, on the 
other hand, responded moderately to murA but strongly to 
tebuf enozide . MS-LacZ cell types infected with an empty 
LNCX vector displayed no staining when stimulated with any 
ligand and further confirmed observations from 

15 vehicle -treated cells that the MS vector permitted only 
relatively low basal levels of transgene expression. 
Approximately 30% of CVDE-, CV-DEiR- , and CVBR- infected 
MS-Z 293 cells responded to murA. Tebufenozide stimulated 
<1% of CVDE-infected cells and <4% of CVDEiR- infected MS-Z 

20 293s. More than 50% of CVBR-inf ected MS-Z 293s responded 
to tebufenozide, however, exceeding all DE-based vectors 
with murA. More dramatic differences were seen in CV-1 and 
FF12 MS-Z infected cells. CVDE alone was not inducible in 
either cell type, indicating that the endogenous level of 

25 RXR in either CV-ls or primary fibroblasts is not 
sufficient to support VDE-mediated transactivation. When 
the endogenous level of RXR is supplemented, as in CVDEiR, 
3-10% of the infected population was observed to strongly 

30 potent transactivator in the CV-1 and FF12 cell types with 
20% of CV-1 and 55% of CVBR- infected FF12 cells 
histochemically positive. 
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Luciferase activity in MS-Luc cells infected in 
parallel with the same receptor viruses confirmed that CVBR 
was not the most potent transactivator in the CV-1 and FF12 
cell types and that all receptor types were functional in 
5 293 cells. The MS-Z data suggest that the low absolute 
level of CVDEiR induction in CV-l and FF12 cells with murA 
most likely reflects a small number of robustly responding 
cells in a largely nonresponsive population as opposed to 
low level induction by most MS- luc- infected cells. 

10 Example 18 

Transient transfections 

Transient transfections are performed by calcium- 
phosphate coprecipitation employing standard methods (see 
Sambrook et al . , in Molecular Cloning : A Laboratory Manual . 

15 Cold Spring Harbor Press, New York, New York (1989)). All 
tissue culture experiments are performed using DMEM 10% FBS 
in a 10% C0 2 incubator unless otherwise specified. All 
transfections are performed in triplicate in 24 -well Costar 
plates using CV-1 cells at an approximately 5X10 4 plating 

20 density. Immediately following transf ection, 1 /xm ligand 
in 20% EtOH/PBS is added to wells. After 40 additional 
hours of incubation, cells are harvested and luciferase 
activity measured in an analytical bioluminescence 
photometer. Cell extracts are simultaneously examined for 

25 LacZ activity of the internal control by standard methods. 
Bar graph levels represent relative luciferase activity 
after correction using internal control values. All 

are provided by either NEB (Beverly, MA) or Strategene (La 
3 0 Jolla, CA) . 



WO 99/02683 



PCT/US98/14215 



Prolonged transqene expression for 
ex vivo gene therapy applications 

Autologous, explanted skin fibroblasts genetically 
5 modified to express tyrosine hydroxylase (TH) , the enzyme 
responsible for synthesis of L-dopa and the precursor for 
the neurotransmitter dopamine, have proven to ameliorate a 
loss of local dopamine in animal models of Parkinson's 
disease. Although quite effective in providing dopamine to 

10 the area of neural damage, this technique is only 
therapeutically useful for 2-3 weeks following 
transplantation, The decrease in effectiveness has been 
traced back to a dramatic loss of TH transgene expression 
in transplanted, post -mitotic cells. The use of invention 

15 constructs allows one to overcome the loss of transgene 
expression by providing stimulation of the retroviral LTR 
promoter through either ligand-activated transact ivating 
complexes or through constitutive transact ivating receptor 
variants. In this way, transgene expression may be 

20 maintained for longer periods of time, even indefinitely if 
desired. 



Stem- type cells, such as stem cells of the 
hematopoetic system, nervous system, or embryo, could be 
infected or transfected with VbR regulated transgenes and 

25 subsequently implanted into adult, fetus, or early 
embryonic animals for either therapeutic or research 
purposes. Cells of the hematopoetic system could 
, ™ ..... ^corxdi,tionally^express- ^prote ins--.. producing- -±>lood^.c-ljOfet ing<= 
factors such as factor IX, metabolic factors such as 

30 glucocerebrosidase, or protective factors including 
anti-HIV proteins. 
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Example 20 
Prolonged transgene expression for 
in vivo gene therapy applications 

The small size and regulatory capacity of VbRs lend 
5 themselves to use in recombinant retroviruses as a method 
of gene transfer. VbRs have been introduced into both MLV 
and lent i viral -based retroviral systems. When these 
viruses are introduced directly into target cells of either 
a mature or developing organism, expression of the virally 
10 encoded transgenes may be regulated by systemic addition of 
ligands such as tebufenozide and derivatives. An example 
of a disease that could be theoretically ameliorated by 
application of in vivo VbR encoding retroviruses is 
Parkinson's disease, described above. 



15 Another application would be to use VbR encoding 

retroviruses as an anti-viral agent. Lentiviral vectors 
with regulated properties and harboring suicide genes or 
"protective" proteins could be used as a means of 
conditionally depleting or destroying HIV-positive cells. 



20 



Example 21 

Modulated transgene expression for either 
in vivo or ex vivo gene therapy application 



Treatment of Parkinson's disease with the chemical 
precursor of dopamine, L-dopa, has proven effective in 
. .^2€^ameliorat -itig- * -many- - © f^fche^defd e i- 1«— *of -~Pa2?k4nsoni em ^ -«Wd-fe h- 
time, however, patients become refractory to L-dopa 
therapy, with the deleterious effects of chronic treatment 
outweighing even the serious symptoms of the disease 
itself. Eventually, patients are left with few therapeutic 
30 options. While the transplantation of TH expressing cells 
may be effective when constantly producing low-levels of L- 
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dopa, a potentially far more beneficial approach would be 
to allow the physician some degree of control over L-dopa 
production in the patient. This would allow sufficient 
control to ensure that the transgenic factor is expressed 
5 at appropriate therapeutic levels. At times when 
endogenous systems are capable of providing full function, 
the transgene may be allowed to become quiescent and 
transcriptionally inactive until needed again. Because the 
transcriptional induction of the invention retroviral 
10 constructs is dependent on an exogenous ligand, expression 
of an integrated therapeutic transgene can be placed under 
the control of the physician and patient. 

Example 22 

Construction of a BORIS vector expressing PolyO-GFP 

15 PolyQ-GFP are constructed from an existing GFP clone, 

GFP-SKSP. PCR primers encoding a novel restriction site 
and ATG start codon at the 5' end, and an Age I site 
in-frame with the GFP ORF at the 3' end are obtained for 
PCR amplification of huntingtin polyQ regions. These PCR 

2 0 products are inserted in- frame into GFP-SKSP to produce the 
fusion constructs of pQ-GFP. The clones are analyzed for 
mutations and function by sequencing and in vitro 
translation to confirm the creation of a pQ-GFP ORF. 
Alternate antibiotic resistance BORIS regulatable 

25 retroviral vectors are produced by removing the 
L-Histidinol gene from plasmid LSHL, by PCR, and inserting 
it in place on the neo r into a BORIS vector for production 

; -t-t*:** o£ < -BORIS -XiHia-, .« , .Tha^pl .asrai ci^are^ .large * seal e ,.pr eppe&> - JU, 
preliminary test for function is transient transfection 

30 experiments into transf ectable cell types such as 293 cells 
and examination for the expression of beta-galactosidase or 
GFP reporter genes. Retrovirus are transiently produced 
and used for infection of target cells. Infected cells are 
selected in L-Histidinol containing media to ensure proper 
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selection. polyQ-GFP is then inserted into regulated 
retroviral vectors. pQ-GFPs are excised from SKSP and 
inserted into the regulatable BORIS and BORIS-LHis vectors. 
The plasmids are large-scale prepped. . Function is 
5 determined by examination for green fluorescence and 
altered phenotype . Additional plasmid are used for 
transient production of retroviruses. 

Primary rat fibroblasts, and the hcnlv-myc are selected for 
initial testing of BPQGS. Both are infected with the 

10 appropriate vector is) place under selection, and individual 
resulting colonies subcloned. The subclones are expanded 
and passaged for analysis. Infected cells are treated or 
not with their respective ligands and the resulting 
phenotypes analyzed. Preliminary analysis will examine GFP 

15 fluorescence and distribution within the cells. At timed 
intervals following ligand induced expression, the pattern 
of fluorescence are recorded to determine if the fusion 
protein changes in distribution. Morphological 
characteristics of stimulated cells their growth rates, and 

20 cell death will also be examined at intervals following 
induction. For the hcn/v-myc cells, the consequences of 
polyQ-GFP expression in the presence and absence of 
tetracycline are compared. 

Example 23 

25 Aggregation of polvO- fusion proteins 

GFP fusion proteins with long (97Q) , short (13Q) , or 

restriction endonuclease sites were designed into the 
regions flanking the polyQ tracts to allow insertion of an 
30 SV40 nuclear localization signal (NLS) or other sequences 
either C-terminal to the polyQ tract. Transient 
transfection analysis of these constructs in the NIT 
expression vector revealed that cells transduced with GFP 
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10 



only or 13Q-GFP expressed GFP uniformly throughout their 
cytoplasm and that the NLS variant 13QN-GFP localized 
predominantly in the nucleus with a uniform distribution. 
97Q- and 97QN-GFP constructs, on the other hand, initially 
manifested as uniform GFP positive fluorescence throughout 
the cytoplasm or nucleus, but within 24-72 hours after 
transfection condensed into bright granules indicative of 
intracellular aggregates (IAs) . IAs were localized within 
the cytoplasm, surrounding the nuclear envelope, and within 
the nucleus itself (Figure IB) . The presence of the SV40 
nuclear localization signal (NLS) , though observed to 
efficiently direct localization of either 13Q- or 97Q-GFP 
variants to the nucleus, resulted in a quantifiable 
difference in the rate of IA formation after transfection. 



15 Example 24 

GFP -fusion proteins with short polyglut amine tracts are 

recruited into IAs 

It was sought to determine whether overexpression of 
13Q-GFP constructs would inhibit or interfere with the 

20 formation of IAs from 97Q- fusion constructs. 13Q-GFP was 
not observed to exert any inhibitory influence on 97Q-GFP 
mediated IA formation, and contrary to expectations, 
appeared to result in a qualitative difference in the IAs 
that formed. To test whether the 13Q-GFP fusion proteins 

25 were participating in the formation of IAs, the 
GFP-encoding portion of 97Q-GFP (97QA) was deleted and this 
construct was cotransf ected with 13Q-GFP into cultured 

large population of the GFP-positive cells within the 24-72 
30 hr. time course observed for 97Q-GFP alone. 13Q-GFP/97QA 
doubly infected cells were distinctive from 97Q-GFP cells 
because they continued to display faint uniform cytoplasmic 
GFP positivity in addition to the formation of 
predominantly nuclear IAs, whereas IA formation within 
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97Q-GFP cells generally results in condensation of all GFP 
fluorescence into IAs leaving other areas of the cell 
nonf luorescent and dark. Control cotransf ection of 
NIT-97QA with NIT-GFP constructs with no N- terminal 
5 polyglutamine tracts did not result in the formation of IAs 
under any conditions tested, indicating that the short 
polyQ tract of 13Q-GFP is necessary for participation in IA 
formation . 

Preliminary studies of 13Q-GFP recruitment were 
10 performed with a molar excess of 97QA plasmid. To further 
determine the molar ratio of 97Q to 13Q required for 
participation of 13Q in IA formation, 13Q-GFP was titrated 
with 97QA in a cotransf ection assay and the percent of IA 
containing cells quantified relative to the total number of 
15 visibly GFP positive cells. No IAs were observed with non 
cotransf ected 97QA or at a 0.1 molar ratio of 97QA plasmid; 
however, at a 0.5 molar ratio of 97QA to 13Q-GFP, 10V of 
GFP positive cells contained GFP positive IAs. At a 1 : 1 
ratio, nearly 40% of GFP positive cells also displayed IAs. 
20 At 10:1 ratio of NIT-97QA to NIT-13Q-GFP, over 75% of 
positive cells displayed IAs 60 hours post -transf ection. 

Example 25 

Characterization of IAs formed from a combination of long 
and short polvO-reporters 

25 To better characterize the formation of IAs in the 

presence of both long and short polyQ tracts, 97QA was 
~^~„ -mc^i^ied^^ 

encoding (3-galactosidase (97QZ) . Preliminary transient 
transfection studies of 97QZ revealed that like 97Q-GFP, 
30 97QZ condensed within transfected cells to form both 
immunopositive and histochemically reactive IAs. NIT-97QZ 
and NIT-13Q-GFP were cotransf ected into cells and 60 hours 
later, performed immunohistochemistry and histochemical 
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reactions on the transfected cells to simultaneously 
examine both reporter constructs. Although the 

(3-galactosidase ((3-gal) histochemical reaction obscures GFP 
fluorescence, through slight modification of standard 
5 staining techniques and limiting the time of exposure to 
reagents, a low level of GFP fluorescence could be 
preserved and imaged along with the characteristic blue 
stain of the (3-galactosidase reaction. A population of IAs 
are observed that are both GFP and p-galactosidase 
10 positive. Numerous large P-gal only positive IAs are also 
present, presumably from the aggregation of 97QZ only (or 
predominantly) . Curiously, several IAs that appear only 
positive for GFP were also observed using the histochemical 
stain and native GFP fluorescence imaging. 



15 Further exploration of this observation was performed 

using confocal microscopy and immunohistochemistry to 
identify reporters within IAs. As in the above study, 
individual IAs were clearly observed to be positive for 
both reporters while other IAs were positive only for (3-gal 

20 for GFP. Taken together, these results clearly indicate 
that long polyglutamine tract -reporter fusion constructs 
form IAs in vitro, and that the forming aggregates can 
either recruit short polyglutamine -reporters into the 
aggregate, induce them to aggregate on their own, or both. 

25 Example 26 

Construction of cell lines with regulated IA formation 
and 970-GFP expression 

The study of IA formation has thus far relied heavily 
on transient transfection analysis and has been hampered by 
30 a lack of stable IA-forming cultured cell populations. 
Through the use of retroviral vectors, a variety of stably 
cultured cell lines conditionally or chronically expressing 
the polyQ-GFP constructs were constructed. Two regulatable 
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vector systems, an induced expression system based on a 
modified ecdysone receptor variant (VbR) and a repressed 
expression system based on the tetracycline transactivator 
(TTA) , were used to create cell lines with inducible IA 
5 formation. These two vectors are best suited to different 
ranges of induction in cultured cell types. VBR-based B3 
vectors are particularly suited to applications where very 
low basal expression levels that can be rapidly stimulated 
to moderately high levels are desired. The TTA-based NIT 

10 vector tends not have the broad-based low level of 
"uninduced" expression characteristic of the BORIS vectors 
(identified in SEQ ID NO:6); however, because 
transactivator expression and transactivation is continuous 
in the absence of ligand, very high transgene expression 

15 levels may be achieved through the use of this vector. 

97Q-GFP was cloned via a custom shuttle vector into 
both retroviral vectors. Transiently produced B3-polyQ-GFP 
retroviruses (B3-13Q or B3-97Q) and TTA-based NIT polyQ-GFP 
retroviruses (NIT-13Q or NIT-97Q) were used at low MOI to 
20 infect several well-characterized cell lines used 
previously in transduction studies of IA formation. COS-7 
cells were selected because several studies had examined IA 
formation and IA- induced apoptosis in this cell type. 
293 cells were utilized for the same reason and also 

2 5 because they are amenable to efficient transfection and 

would have significant utility in the development of 
gene-based factors that interact with or interrupt IA 
formation. Primary rat fibroblasts were selected because 
~^^fc-hey-^^-^ readily- ' ; ente^^.a.-,-pr_QXongeci 

3 0 post -mitotic state following contact inhibition at high 

culture densities. 

Pilot studies with reporter constructs revealed that 
the B3 vector functioned most efficiently in the 293 and 
COS cell lines and that NIT functioned at highest 



WO 99/02683 



PCT/US98/14215 



82 

efficiency in the primary fibroblast line. Individual cell 
lines infected with the appropriate vector were placed 
under G418 selection to remove uninfected cells. For 
B3-polyQ infected 293 and COS-7 cells, the resulting 
5 resistant population was pooled and passaged without 
ligand. For NIT-polyQ infected primary fibroblasts, cells 
were cultured with the tetracycline analog doxycycline 
prior to, during, and after infection to prevent chronic 
polyQ overexpression to disrupt growth of resistant 
10 colonies. 293 and COS-7 cells were also infected with NIT 
constructs and cultured without added doxycycline to 
determine the effects of chronic overexpression of 97Q-GFP 
on these cell types as described below. 

Example 27 

15 Stable IA-forming 293 cells 

G418-resistant B3-97Q 293s displayed little to no 
visible GFP fluorescence during culture in the absence of 
ligand; however, introduction of 1 /xM tebufenozide into the 
culture medium resulted in a rapid increase of cytoplasmic 

2 0 fluorescence within 24 hours post -induct ion. Nearly 10 0% 

of cells would display GFP-positive fluoresence within 
24-72 hours of induction. After 24 hours, only widely 
scattered cells would contain small, detectable IAs . By 
48 hours, >8% of the cell population would contain one or 

25 more IAs increasing to >15% of the population between 48 
and 72 hours. Occasional IA containing cells would be 
observed floating in the culture medium; however, there was 

,^^^K>^evi^ent . " 

or later times.. Passage of B3-97Q 293s in the presence of 

3 0 ligand for three passages resulted in only a small but 

detectable decrease in the percentage of cells containing 
IAs (15% to 11%) . 
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NIT-97Q infection sheds further light on the toxic 
potential of long-polyglutamine repeats on 293 cells. 
Surprisingly, colonies of 100% IA-positive 2 93 cells 
survived for over two weeks and apparently divided and 
5 grew, in some cases, to colonies of hundreds of cells 
(Figure 4D) . By the end of 21 days of culture; however, 
the majority of IA positive colonies had disappeared 
leaving only scattered IA positive cells and numerous 
colonies of GFP positive cells without IAs that eventually 
10 grew to fill the plates. The expression time-course and 
level of B3- AND NIT-13Q cells paralleled the 97Q cells 
although no IAs were formed. 

Example 28 
Stable IA-formina CQS-7 cells 

15 G418- resistant B3-97Q COS- 7s displayed no visible GFP 

fluoresence during culture in the absence of ligand in a 
manner almost identical to the 2 93 B3-97Q population. 
Addition of 1 /zM tebufenozide into the culture medium 
resulted in a rapid increase of cytoplasmic fluorescence 

20 within 24-48 hours post- induction , IA formation peaked at 
4-5% of total cells (at any given moment) 72-96 hours 
post-induction. Unlike the B3-97Q 293 cells, a continuous 
population of G418-resistant cells did not display even GFP 
fluoresence suggesting that one or more components of the 

25 regulatable system were lost from the COS population. Like 
B3-97Q 293 cells, IA containing COS-7 cells were also 
observed floating in the culture medium; however a 

^v^^ep.leti^n^.^ 

presumably because the continuous formation of IAs in other 

30 cells. 

The effects of chronic 97Q-GFP overexpression in 
infected COS-7 cells was examined using the NIT vector . 
Like NIT-97Q infected 293s, colonies in which the majority 
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of cells contained large and numerous IAs were readily- 
observed. Unlike the overexpressing 293 cells; however, 
several selected colonies continued to grow and express IAs 
in a continual percentage of the population even after 
5 passage. Three lines, COSN97Q-4, 6 and 7, continually 
express IAs after continuous growth for over four weeks . 
COSN97Q-7 displays the highest level of spontaneous IA 
formation. At any given time, approximately 6% of the 
total population contains readily observable IAs . CV~1 
10 cells, a predecessor of the COS- 7 cells were also infected 
with NIT-97Q and formed IA containing colonies in much the 
same manner as COS cell populations . 

Example 29 
Stable IA forming primary fibroblasts 

15 B3-97Q infected primary fibroblasts (FF12s) became 

faintly GFP fluorescent within 72 hours of tebufenozide 
treatment but did not achieve a maximum level of 97Q-GFP 
expression sufficient to result in IA formation in a 
significant population of cells. The NIT vectors were 
20 employed to generate regulated 97Q-GFP expression. 
Infected NIT-97Q FF12s were infected and cultured 
continuously in the presence of doxycycline (dox) to 
prevent IA formation. Twelve individual colonies of 
NIT-97Q FF12s were selected in order to screen for 
25 individual cell populations with low dox inhibited 
expression and high 97Q expression in the absence of 
ligand. One line, FFN97Q-5 displayed GFP fluoresence 
,.wj. t iiig pf^t^ r ^cgjifluent T eel 1 ^popu 1 a t ^ri_ ^fj^i^^^^SX^^ 
of ligand removal. IAs began forming on day 5 and 
30 continued to increase through day 8. Approximately 15% of 
the total cell population displayed IAs 10 days post ligand 
removal. IA formation in the confluent FFN97Q-5 population 
was distinctive from other cell types examined in that it 
was occasionally exclusively nuclear and at other times 
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predominantly cytoplasmic in an unusual pattern within the 
cell body. IAs with a distinctive stellate or tri-partite 
appearance were often observed. Large cytoplasmic 
aggregates of multiple star- like IAs were also occasionally 
5 seen. Curiously, confluent FFN97Q-5 cells with multiple 
cytoplasmic or nuclear IAs appeared visually normal. 
Floating dead cells with or without IAs were rarely 
observed and in numbers no different from control wells. 
FF970Q-%s cultured in the presence of dox neither expressed 
10 visible 97Q-GFP nor formed IAs. 



Example 30 
Use of invention for efficient 
production of transgenic animals 



Transgenic animals are generally produced by either 

15 pronuclear injection of DNA or by transfection of embryonic 
stem (ES) cells followed by selection and injection of the 
stem cell into the inner cell mass of very early embryos. 
Pronuclear injection results in approximately 5-10% stable 
gene transfer in the production of transgenic mice. The 

20 use of ES cells in producing transgenics is likewise 
inefficient in generating mosaics with germ-line 
transmission of the transgene. It was proposed in the mid- 
1980' s to use retroviruses to transfer transgenes with high 
efficiency into early embryos or ES cells to dramatically 

25 enhance the odds of producing transgenic animals. All 
attempts at this failed, not because the virus was 
incapable of stably integrating into the target cell 

«~ ^genc-rae^kut^ 

any of the genes encoded within the viral transcriptional 

30 cassette. 



The present invention is capable of overcoming the 
transcriptional block to result in germ-line transgenic 
animals with full expression from the integrated transgene. 
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In addition to expressing the transgene, the level of 
transcription may still be regulated by controlling the 
supply of ligand to the transgenic animal. The increased 
efficiency of producing transgenic animals by retroviral 
5 infection should open up the way to producing mutant 
animals of a variety of species previously impractical for 
genetic modification because of the potential cost of 
producing a large number of non-positive animals by 
classical methods. 

10 To produce transgenic mice, the following nucleic acid 

constructs are prepared and subsequently injected into 
fertilized eggs: CD3-VbR and a ligand inducible S-gal 
reporter. Two separate lines of transgenic mice are 
generated harboring either a ligand inducible reporter, or 

15 a T-cell specific expression construct of VbR, 
respectively. The former are referred to as reporter mice, 
the latter are referred to as receptor mice, and double 
transgenic mice are referred to as receptor/reporter mice. 
Constructs CD3-VbR are injected, while the reporter is 

20 injected alone. Primary genotyping is performed by 
Southern blot analysis and the transmission of transgenic 
mice is monitored by dot blot analysis. Receptor mice are 
analyzed for VbR expression by Northern blot analysis of 
RNA collected from these mice. For Northern blot analysis, 

2 5 15/^g of total RNA obtained from the thymus, and various 
tissues as a control, is run on a denaturing gel and 
blotted onto a nitrocellulose membrane. The blot is probed 
with a radiolabeled S-gal-specif ic probe and exposed on 

30 transferred to the offspring as expected by Mendeiian 
genetics . 



WO 99/02683 



PCT/US98/14215 



87 

Example 31 

Use of retroviral constructs in the invention for 
efficient gene transfer to developing embryos 

Since the present invention can effectively overcome 
5 the block of viral expression in embryonic cells, invention 
constructs are a potent tool in the delivery of transgenes 
to somatic tissues of a developing embryo. With many 
diseases, considerable damage is done during embryonic 
development so that therapies applied after birth are 
10 essentially ineffective to ameliorate the disease 
phenotype . 



The present invention provides methodology where one 
can infect cells of the embryo and can provide therapeutic 
factors to the developing fetus either constitutively, or 
15 under the regulation of exogenously produced ligand. 



Example 32 
Use of vector constructs in invention 
with inducible high titers 



One obstacle to the use of retroviruses as gene 
20 transfer agents is that titers of retroviruses from 
existing producer cell lines are only on the order of 1X1 0 4 
or 1X10 5 . By using a retroviral construct of the invention 
having intact enhancers and regulatory elements, expression 
of the retrovirus may be induced by greater than ten- fold, 
25 resulting in correspondingly higher titers of infectious 

While the invention has been described in detail with 
reference to certain preferred embodiments thereof, it will 
be understood that modifications and variations are within 
30 the spirit and scope of that which is described and 
claimed. 
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1 . A nucleic acid construct encoding a receptor 
peptide comprising: 

(i) a ligand binding domain and hinge region of a 
non- mammalian member of the nuclear receptor 
superf amily 

(ii) an activation domain, and 

(iii) a DNA binding domain; 

wherein said receptor peptide activates regulatory 
element (s) in the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for said ligand 
binding domain. 

2. A nucleic acid construct according to claim 1 
wherein said ligand for said ligand binding domain is a 
non-steroidal compound not normally present in a 
mammalian cell. 

3. A nucleic acid construct according to claim 1, 
wherein said ligand binding domain and hinge region are 
derived from a lepidopteran species . 

4. A nucleic acid construct according to claim 1, 
wherein said ligand binding domain and hinge region are 
derived from bombyx mori. 

claitn_ 1_ ^ 

wherein said ligand binding domain has substantial 
sequence identity with amino acid sequence 393-508, as 
set forth in SEQ ID NO: 3. 

6 . A nucleic acid construct according to claim 1 
wherein said ligand binding domain has substantial 
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sequence identity with amino acid sequence 3 93-586, as 
set forth in SEQ ID NO: 3. 

7. A nucleic acid construct, according to claim 1 
wherein said hinge region has substantial sequence 
identity with amino acid sequence 273-362, as set forth 
in SEQ ID NO:3. 

8. A nucleic acid construct according to claim 1 
wherein said hinge region has substantial sequence 
identity with amino acid sequence 273-362, as set forth 
in SEQ ID NO: 3. 

9. A nucleic acid construct according to claim 1 
wherein said DNA binding domain is obtained from a DNA- 
binding protein. 

10. A nucleic acid construct according to claim 9 
wherein said DNA binding domain is obtained from a member 
of the nuclear receptor superfamily. 

11. A nucleic acid construct according to claim 10 
wherein said DNA binding domain is non -mammalian. 

12. A nucleic acid construct according to claim 1 
wherein said receptor peptide has substantial sequence 
identity with the amino acid sequence set forth in SEQ ID 

13. A nucleic acid construct according to claim 1, 
wherein said nucleic acid construct further comprises a 
regulatory element operative ly associated with exogenous 
nucleic acid(s) . 
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14 . A nucleic acid construct according to claim 13 
wherein said regulatory element comprises a hormone 
response element . 

15. A nucleic acid construct according to claim 14 
wherein said construct comprises a plurality of hormone 
response elements . 

16. A nucleic acid construct according to claim 13 
wherein said exogenous nucleic acid(s) is {are) selected 
from wild type and therapeutic nucleic acid(s) . 

17. A nucleic acid construct according to claim 16 
wherein said wild type nucleic acid(s) is (are) selected 
from genes which encode products: 

the substantial absence of which leads to the 
occurrence of a non-normal state in said host ; or 

a substantial excess of which leads to the 
occurrence of a non-normal state in said host. 

18. A nucleic acid construct according to claim 17, 
wherein said exogenous nucleic acid is a CAG repeat 
encoding a polyglutamine polypeptide. 

19. A nucleic acid construct according to claim 16 
wherein said therapeutic nucleic acid{s) is (are) 
selected from those which encode products: 

expressed; or 

which impart a beneficial property to said 

host . 

20. A nucleic acid construct according to claim 19 
wherein said therapeutic nucleic acid(s) modulates the 
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transcription and/or translation of endogenous nucleic 
acid(s) . 

21. A nucleic acid construct according to claim 1, 
further comprising nucleic acid encoding a selectable 
marker. 

22. A nucleic acid construct according to claim 21 
wherein said selectable marker is a radiolabeled 
molecule, a fluorescent molecule, an enzyme, or a ligand. 

23. A nucleic acid construct according to claim 22 
wherein said enzyme is an antibiotic resistance gene or a 
gene which enables cells to process a metabolic 
intermediary. 

24 . An expression cassette comprising a promoter 
operatively associated with a nucleic acid construct 
according to claim 1. 

25. An expression cassette according to claim 24, 
wherein said promoter is selected from inducible, 
constitutive and/or tissue specific promoters. 

26. A viral vector capable of ligand -mediated 
expression, wherein said vector comprises an expression 
cassette according to claim 24 . 

27. A viral vector according to claim 26, wherein 
said viral vector further comprises a retroviral psi W 
packaging signal and a 5' and/or a 3' long terminal 
repeat (LTR) . 



SUBSTITUTE SHEET (RULE 26) 



WO. 99/02683 



PCT/US98/14215 



92 

28. An animal cell in culture which is transformed 
with a nucleic acid construct according to claim 1 . 

29. A cell according to claim 28, wherein said cell 
is further transformed with a reporter vector comprising 
a regulatory element operatively associated with 
exogenous nucleic acid(s) . 

30. An animal cell in culture which is transformed 
with a nucleic acid construct according to claim 13. 

31. An animal cell according to claim 30, wherein 
said cell is an embryonic stem cell. 

32. A transgenic animal, wherein cells of said 
animal is transf ected with, a nucleic acid construct 
according to claim 1 . 

33. A transgenic animal, wherein cells of said 
animal is transfected with a nucleic acid construct 
according to claim 13. 

34 . A polypeptide receptor encoded by a nucleic 
acid construct according to claim 1. 

35. A nucleic acid construct encoding a receptor 
peptide comprising : 

(i) a ligand binding domain and hinge region of a 
non-mammalian member of the nuclear receptor 
super family, and 

(ii) a DNA binding domain, wherein said DNA binding 
domain is not derived from said non-mammalian 
member ; 

wherein said receptor peptide activates regulatory 
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element (s) in the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for said ligand 
binding domain. 

36. A nucleic acid construct according to claim 35, 
wherein said receptor peptide further comprises: 

(iii) an activation domain. 

37. A nucleic acid construct according to claim 35, 
wherein said nucleic acid construct further comprises a 
regulatory element operatively associated with exogenous 
nucleic acid(s) . 

38. An expression cassette comprising a promoter 
operatively associated with a nucleic acid construct 
according to claim 35. 

39. A viral vector capable of ligand-mediated 
expression, wherein said vector comprises an expression 
cassette according to claim 38. 

40. An animal cell in culture which is transformed 
with a nucleic acid construct according to claim 35. 

41. A cell according to claim 40, wherein said cell 
is further transformed with a regulatory element 
operatively associated with exogenous nucleic acid(s) . 

42. A polypeptide receptor comprising: 
a VP16 activation domain, 

a DNA binding domain, and 

bombyx mori -derived ligand binding domain and hinge 
region. 
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43. A nucleic acid construct comprising a 
regulatory element derived from a bombyx receptor 
response element operatively associated with exogenous 
nucleic acid(s) . 

44. An animal cell in culture which is transformed 
with a nucleic acid construct according to claim 43. 

45. An animal cell according to claim 44, wherein a 
polypeptide receptor is further introduced into said 
cell, said receptor comprising: 

an activation domain, 
a DNA binding domain, 

and the ligand binding domain and hinge region of a 
non-mammalian member of the nuclear receptor 
superf amily , 

wherein said receptor peptide activates regulatory 
element in the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for said ligand 
binding domain. 

46. A transgenic nonhuman mammal, wherein cells of 
said mammal are transfected with the nucleic acid 
construct according to claim 43. 

47. A method for modulating the transcription of 
exogenous nucleic acid(s) in a host containing: 

(i) a nucleic acid construct comprising a promoter 
and said exogenous nucleic acid(s) under the 
control of a regulatory element; and 

(ii) a receptor peptide comprising a DNA binding 
domain, and the ligand binding domain and hinge 
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region of a non-mammalian member of the nuclear 
receptor superfamily which is not normally 
present in the cells of said host, wherein said 
receptor peptide activates said regulatory 
element in the absence of an exogenous dimer 
partner therefor and in the presence of a 
ligand for said ligand binding domain, 

said method comprising administering to said host an 
amount of ligand effective to modulate the transcription 
of said exogenous nucleic acid(s) ; wherein ligand is not 
normally present in the cells of said host. 

48. A method according to claim 47 wherein said 
receptor peptide is provided by a nucleic acid construct 
encoding said receptor peptide. 

49. A method according to claim 48 wherein said 
receptor peptide is expressed under the control of a 
tissue specific promoter. 

50. A method according to claim 47 wherein said 
exogenous nucleic acid(s) is (are) selected from wild 
type and therapeutic nucleic acid(s) . 

51. A method according to claim 50 wherein said 
wild type nucleic acid(s) is (are) selected from genes 

^W^h^jencode f grgftocts:^ . ^ ^ ^ _ 

the substantial absence of which leads to the 

occurrence of a non-normal state in said host; or 
a substantial excess of which leads to the 

occurrence of a non-normal state in said host. 



BNSDOClD. <WO_ 99026&3A1J > 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02683 



PCT/US98/14215 



96 

52. A method according to claim 50 wherein said 
therapeutic nucleic acid(s) is (are) selected from those 
which encode products: 

which are toxic to the cells in which they are 
expressed; or 

which impart a beneficial property to said 

host. 

53 . A method according to claim 50 wherein said 
therapeutic nucleic acid(s) modulates the transcription 
and/or translation of an endogenous nucleic acid(s). 

54. A method according to claim 47 wherein said 
regulatory element is not normally present in the cells 
of said host. 

55. A method according to claim 47 wherein said DNA 
binding domain is obtained from a DNA-binding protein. 

56. A method according to claim 55 wherein said DNA 
binding domain is obtained from a member of the nuclear 
receptor superfamily. 

57. A method according to claim 55 wherein said DNA 
binding domain is a GAL4 DNA binding domain. 

58. A method according to claim 47 wherein said 

. rec^eptpx, t P.?J?Jt ide^ . 

59. A method according to claim 47 wherein said 
host further contains a non-mammalian transactivator not 
normally present in the cells of said host and a 
transactivator responsive regulatory element not normally 
present in cells of said host, 
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wherein said transactivator responsive 
regulatory element controls transcription of said 
exogenous nucleic acid(s) or a second nucleic acid 
construct comprising a second exogenous nucleic 
acid (s) . 



60. A method according to claim 59 wherein said 
exogenous nucleic acid(s) is (are) under the further 
control of said transactivator responsive regulatory 
element. 

61. A method according to claim 59 wherein said 
transactivator responsive regulatory element controls 
transcription of a second DNA construct comprising a 
second exogenous nucleic acid(s) . 

62. A method according to claim 59 wherein said 
transactivator responsive regulatory element comprises an 
operator which is responsive to said non-mammalian 
transactivator which confers responsiveness to 
antibiotics. 



63. A method according to claim 59 wherein said 
transactivator is a ligand-mediated receptor which 
confers responsiveness to antibiotics. 

64. A method according to claim 4 7 wherein said 
,h4 n 9$~J£4?i^ with amino 
acid sequence 273-362, as set forth in SEQ ID N0:3. 



65. A method according to claim 47 wherein said 
ligand binding domain has substantial sequence identity 
with amino acid sequence 3 93-508, as set forth in SEQ ID 
NO: 3. 
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66. A method according to claim 47 wherein said 
ligand binding domain has substantial sequence identity 
with amino acid sequence 393-586, as set forth in SEQ ID 
NO: 3. 

67. A method according to claim 47 wherein said 
receptor peptide has substantial sequence identity with 
the sequence set forth in SEQ ID NO:3. 

68. A method according to claim 47 wherein said 
receptor peptide has the sequence set forth in SEQ ID 
NO: 3 . 

69. A method according to claim 47 wherein said 
ligand for said ligand binding domain is a non-steroidal 
compound . 

70. A method according to claim 69 wherein said 
non-steroidal compound is a hydrazine. 

71. A method according to claim 70 wherein said 
hydrazine is a 1,2-diacyl hydrazine, an N' -substituted- 
N, N ' -disubstituted hydrazine, a dibenzoylalkyl 
cyanohydrazine, an N-substituted-N-alkyl-N,N-diaroyl 
hydrazine, an N-substituted-N-acyl-N-alkyl , carbonyl 
hydrazine, and an N-aroyl-N' -alkyl-N' -aroyl hydrazine. 

72 . A method according to claim 71 j^herein said 
hydrazine is a diacyl hydrazine. 

73. A method of inducing the transcription of an 
exogenous nucleic acid(s) in a host containing: 

(i) a nucleic acid construct comprising a promoter 
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and said exogenous nucleic acid(s) under the 
control of a regulatory element; 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in 
the cells of said host, wherein expression of 
said receptor peptide is under the control of 
an inducible promoter, wherein said receptor 
peptide activates said regulatory element in 
the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for 
said ligand binding domain, and 

(iii) said ligand for said ligand binding 
domain, wherein said ligand is not 
normally present in the cells of said 
host ; 

said method comprising subjecting said host to 
conditions suitable to induce expression of said receptor 
peptide. 

74. A method of inducing the transcription of an 
exogenous gene in a host containing a nucleic acid 
construct comprising a promoter and said exogenous 
nucleic acid(s) under the control of a regulatory 
element; said method comprising introducing into a host: 

a receptor peptide comprising a DNA binding 
domain, and the ligand binding domain and hinge 
region of a non-mammalian member of the nuclear 
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receptor superfamily which is not normally present 
in the cells of said host, wherein said receptor 
peptide activates said regulatory element in the 
absence of an exogenous dimer partner therefor and 
in the presence of a ligand for said ligand binding 
domain, and 

a ligand for said ligand binding domain. 

75. A method for the treatment of a host in need of 
gene therapy, said method comprising: 

introducing into cells of said host : 

(i) a nucleic acid construct comprising a promoter 
and said exogenous nucleic acid{s) under the 
control of a regulatory element; 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in 
the cells of said host, wherein expression of 
said receptor peptide is under the control of 
an inducible promoter, wherein said receptor 
peptide activates said regulatory element in 
the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for 

^ ^ ^ tv ^^.^ z ^ _and 3 ^ 

administering, to said host, ligand for said ligand 
binding domain. 

76 . A method for the treatment of a host in need of 
gene therapy, said method comprising: 
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introducing into cells obtained from said host; 

(i) a nucleic acid construct comprising a promoter 
and exogenous nucleic acid(s) under the control 
of a regulatory element; and 

(ii) nucleic acid endocing a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in 
the cells of said host, wherein said receptor 
peptide activates said regulatory element in 
the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for 
said ligand binding domain. 



to provide modified cells; 

reintroducing the modified cells into said host, and 
administering, to said host, ligand for said ligand 
binding domain. 

77. A method for producing transgenic animals 
capable of prolonged and regulated transcription of 
exogenous nucleic acid(s) , said method comprising 
introducing into early-stage embryos or stem cells: 



(i) a nucleic acid construct comprising a promoter 
control of a regulatory element; 



(ii) nucleic acid encoding a receptor peptide 

comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
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superfamily which is not normally present in 
the cells of said transgenic animal, wherein 
said receptor peptide activates said regulatory- 
element in the absence of an exogenous dimer 
partner therefor and in the presence of a 
ligand for said ligand binding domain. 

78 . A method for the expression of recombinant 
products detrimental to a host organism, said method 
comprising: 

transforming suitable host cells with: 

(i) a nucleic acid construct comprising a promoter 
and exogenous nucleic acid(s) which express 
said recombinant product under the control of a 
regulatory element; wherein said regulatory 
element is not normally present in the cells of 
said host, and 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the. ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in 
the cells of said host, wherein said receptor 
peptide activates said regulatory element in 
the absence of an exogenous dimer partner for 

.„^sald JLigand binding domain, and 

growing said host cells to the desired level in the 
substantial absence of ligand for said receptor peptide; 
and 

inducing expression of said recombinant product by 
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introducing into said host cells said ligand, which, in 
combination with said receptor peptide, binds to said 
regulatory element and activates transcription therefrom. 

79. A method according to claim 78, wherein said 
exogenous nucleic acid is a CAG repeat which encodes a 
polyglutamine polypeptide . 

80. A method for identifying compounds useful for 
treating neuodegenerative conditions, said method 
comprising : 

introducing a test compound into a host cell 
transformed according to claim 79, and 

identifying those test compounds that target 
intracellular aggregates of polyglutamine s as compounds 
useful for treating neuodegenerative conditions. 

81. A method for modulating the transcription of 
nucleic acid(s) in an in vitro system, said method 
comprising administering to said system an amount of said 
ligand effective to modulate the transcription of said 
nucleic acid(s); wherein said ligand is not normally 
present in said cellular system; wherein said system 
comprises : 

(i) a nucleic acid construct comprising a promoter 
and said nucleic acid(s) under the control of a 

wra:a _^ mwm ^ _ _ 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in 
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the cells of said host, wherein said receptor 
peptide activates said regulatory element in 
the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for 
said ligand binding domain. 

82. A nucleic acid construct comprising nucleic 
acid encoding a VP16 activation domain, a DNA binding 
domain and bombyx mori -derived ligand binding domain and 
hinge region. 

83 . A nucleic acid construct comprising a promoter 
and nucleic acid encoding a tetracycline-controlled 
transact ivator, a VP16 activation domain, a DNA binding 
domain and bombyx mori- derived ligand binding domain and 
hinge region, in operative association with one another. 



84. A vector comprising: 

a first reporter under the control of a regulatory 
element , 

a second reporter under the control of an operator 
which is responsive to a ligand -mediated receptor which 
confers responsiveness to antibiotics, and 

a nucleic acid construct comprising a promoter 
operatively associated with nucleic acid encoding a 
tetracycline-controlled transact ivator, a VP16 activation 
domain, a DNA binding domain and bombyx mori -derived 
i gand^b inding-^ 
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Poly - Q expression constructs 
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B3 - V polyQ - GFP constructs to date 
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SEQUENCE LISTING 
<110> The Salk Institute for Biological Studies 



<120> Modified Lepidopteran Receptors and 

Hybrid Multi- Functional Proteins for Use in Transcription 
and Transgene Expression Regulation 



<130> SALK2070WO 

<140> PCT/US98/14215 
<141> 1998-07-10 

<150> 60/091874 
<151> 1998-07-07 

<150> 08/891,298 
<151> 1997-07*10 

<160> 8 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 71 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> DNA binding domain of member of steroid/thyroid 
hormone receptor superfamily 





<400> 


1 
















Cys 


Xaa 


Xaa 


Cys 


Xaa 


Xaa Asp 


Xaa 


Ala Xaa Gly Xaa Tyr 


Xaa 


Xaa 


Xaa 


1 








5 






10 




15 




Xaa 


Cys 


Xaa 


Xaa 


Cys 


Lys Xaa 


Phe 


Phe Xaa Arg Xaa Xaa 


Xaa 


Xaa 


Xaa 








20 








25 


30 






Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa Cys 


Xaa 


Xaa Xaa Xaa Xaa Xaa 


Xaa 


Xaa 


Cys 






35 








40 


45 








Xaa 


Xaa 


Xaa 


Lys 


Xaa 


Xaa Arg 


Xaa 


Xaa Cys Xaa Xaa Cys 


Arg 


Xaa 


Xaa 




50 








55 




60 








Lys 


Cys 


Xaa 


Xaa 


Xaa 


Gly Met 












65 










70 













2 

2711 
DNA 

Bombyx mqri f _ ^ ^ ^ ^ 

<220> 
<221> CDS 

<222> (185) . . . (2002) 
<400> 2 

ctccgttgac gacggtcgcg cgtgcggtac gtccgttttt acggctcaaa cgtacacggt 60 
aacctccgtc tctgcatcat cggcggaact cgtgaaattc gcgtgctttt ctcacctgtt 120 
gaacgagttg tgttgtgact gaaaaacatc atcacaaata tcaagcttca aaactaataa 180 
gtga atg aga gtc gag aac gtg gat aac gta teg ttt get ttg aac gga 229 
Met Arg Val Glu Asn Val Asp Asn Val Ser Phe Ala Leu Asn Gly 



<210> 
<211> 
<212> 

^n3>, v 
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15 10 15 

cgc get gac gag tgg tgt atg tct gta gag acg cgt tta gat agt tta 277 
Arg Ala Asp Glu Trp Cys Met Ser Val Glu Thr Arg Leu Asp Ser Leu 
20 25 30 

gtg cga gaa aaa agt gaa gtg aaa gec tac gtc gga gga tgt ccc teg 325 
Val Arg Glu Lys Ser Glu Val Lys Ala Tyr Val Gly Gly Cys Pro Ser 
35 40 45 

gta ate acg gat get gga gcg tat gac gcg etc ttc gac atg aga cgc 373 
Val He Thr Asp Ala Gly Ala Tyr Asp Ala Leu Phe Asp Met Arg Arg 
50 55 60 

cgc tgg tct aat aac ggt ggc ttc ccg ctg cga atg ctt gaa gag age 421 
Arg Trp Ser Asn Asn Gly Gly Phe Pro Leu Arg Met Leu Glu Glu Ser 
65 70 75 

tct tea gaa gtg aca teg tct teg gca ctg ggt ttg cca ccg gee atg 469 
Ser Ser Glu Val Thr Ser Ser Ser Ala Leu Gly Leu Pro Pro Ala Met 
80 65 90 95 

gtt atg teg ccg gaa tec ttg gcg teg ccc gag tat cga gec etc gag 517 
Val Met Ser Pro Glu Ser Leu Ala Ser Pro Glu Tyr Arg Ala Leu Glu 
100 105 110 

eta tgg age tac gat gac gga ate act tat aat aca gec cag tct ctg 565 
Leu Trp Ser Tyr Asp Asp Gly He Thr Tyr Asn Thr Ala Gin Ser Leu 
115 120 125 

ctg ggt gca tgc aat atg caa cag caa cag eta caa cct cag caa cca 613 
Leu Gly Ala Cys Asn Met Gin Gin Gin Gin Leu Gin Pro Gin Gin Pro 
130 135 140 

cat cca gca cca ccg acg etc ccc acg atg cct tta cca atg cct ccc 661 
His Pro Ala Pro Pro Thr Leu Pro Thr Met Pro Leu Pro Met Pro Pro 
145 150 155 

aca aca ccg aaa tea gaa aat gaa teg atg tea tea ggt cga gag gaa 709 
Thr Thr Pro Lys Ser Glu Asn Glu Ser Met Ser Ser Gly Arg Glu Glu 
160 165 170 175 

ctt teg ccg get tea age ata aat ggc tgc agt get gat get gac gec 757 
Leu Ser Pro Ala Ser Ser He Asn Gly Cys Ser Ala Asp Ala Asp Ala 
180 185 190 

aga egg cag aag aaa ggt cct gca cct cga cag caa gag gag eta tgt 805 
Arg Arg Gin Lys Lys Gly Pro Ala Pro Arg Gin Gin Glu Glu Leu Cys 
195 200 205 

Leu Val Cys Gly Asp Arg Ala Ser Gly Tyr His Tyr Asn Aia Leu" Thr 
210 215 220 

tgt gaa gga tgc aaa gga ttc ttc agg egg agt gtc acc aaa aac gca 901 
Cys Glu Gly Cys Lys Gly Phe Phe Arg Arg Ser Val Thr Lys Asn Ala 
225 230 235 

gta tat att tgt aaa ttt gga cat gec tgt gaa atg gat atg tac atg 949 
Val Tyr He Cys Lys Phe Gly His Ala Cys Glu Met Asp Met Tyr Met 
240 245 250 255 
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agg agg aaa tgt caa gag tgt cga tta aag aaa tgt eta gcg gta gga 
Arg Arg Lys Cys Gin Glu Cys Arg Leu Lys Lys Cys Leu Ala Val Gly 
260 265 270 



997 



atg agg cct gaa tgt gtc ata cag gag ccc agt aaa aat aaa gac agg 
Met Arg Pro Glu Cys Val He Gin Glu Pro Ser Lys Asn Lys Asp Arg 
275 280 285 



1045 



caa aga caa aag aaa gac aaa gga ata tta tta cct gtt agt acg acc 
Gin Arg Gin Lys Lys Asp Lys Gly He Leu Leu Pro Val Ser Thr Thr 
290 295 300 



1093 



aca gtc gaa gac cac atg ccc ccg ate atg caa tgt gat cca cct ccg 
Thr Val Glu Asp His Met Pro Pro He Met Gin Cys Asp Pro Pro Pro 
305 310 315 



1141 



ccc gag gec gec agg att cac gaa gtc gtc ccg agg tat ctt teg gag 
Pro Glu Ala Ala Arg He His Glu Val Val Pro Arg Tyr Leu Ser Glu 
320 325 330 335 



1189 



aag ctg atg gag cag aac agg cag aag aac ata cca cca ttg teg gcg 
Lys Leu Met Glu Gin Asn Arg Gin Lys Asn He Pro Pro Leu Ser Ala 
340 345 350 



1237 



aat cag aag tct ctg ate gcg agg etc gtg tgg tac cag gag gga tat 
Asn Gin Lys Ser Leu He Ala Arg Leu Val Trp Tyr Gin Glu Gly Tyr 
355 360 365 



1285 



gag cag ccc tec gac gag gat etc aaa aga gta acg cag act tgg cag 
Glu Gin Pro Ser Asp Glu Asp Leu Lys Arg Val Thr Gin Thr Trp Gin 
370 375 380 



1333 



teg gat gaa gag gac gag gaa tec gat eta ccc ttc cgc cag ate acg 
Ser Asp Glu Glu Asp Glu Glu Ser Asp Leu Pro Phe Arg Gin He Thr 
385 390 395 



1381 



gag atg acg ate tta acg gtc cag ttg ate gtc gag ttc gec aag ggt 
Glu Met Thr He Leu Thr Val Gin Leu lie Val Glu Phe Ala Lys Gly 
400 405 410 415 



1429 



eta ccg ggc ttt teg aag ata tea cag tct gat caa ate acc tta tta 
Leu Pro Gly Phe Ser Lys He Ser Gin Ser Asp Gin He Thr Leu Leu 
420 425 430 



1477 



aaa gec teg tec age gag gtg atg atg ctg egg gtg gcg agg cga tac 
Lys Ala Ser Ser Ser Glu Val Met Met Leu Arg Val Ala Arg Arg Tyr 
435 440 445 



1525 



gac gec gcg tec gac age gtg ctg ttc gec aac aac aag gcg tac acg 
Asp Ala Ala Ser Asp Ser Val Leu Phe Ala Asn Asn Lys Ala Tyr Thr 



1573 



cgc gac aac tac cgc caa ggc ggc atg gec tac gtc ate gaa gac etc 
Arg Asp Asn Tyr Arg Gin Gly Gly Met Ala Tyr Val He Glu Asp Leu 
465 470 475 



1621 



eta cac ttc tgc egg tgc atg ttc gcg atg ggc atg gac aat gtg cac 
Leu Hie Phe Cys Arg Cys Met Phe Ala Met Gly Met Asp Asn Val His 
480 485 490 495 



1669 



ttt gca ctg etc acg gec ate gtt ata ttc tea gat egg ccc ggg etc 
Phe Ala Leu Leu Thr Ala He Val He Phe Ser Asp Arg Pro Gly Leu 



1717 
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500 



505 



510 



gag cag ccg teg ctg gta gaa gag ate cag aga tac tac ctg aac acg 
Glu Gin Pro Ser Leu Val Glu Glu lie Gin Arg Tyr Tyr Leu Asn Thr 
515 520 525 



1765 



ttg cga att tac ate ate aac cag aac age gcg teg teg cgc tgc gec 
Leu Arg lie Tyr He He Asn Gin Asn Ser Ala Ser Ser Arg Cys Ala 
530 535 540 



1B13 



gtg ate tac ggc agg ate ctg age gtg ctg acc gag eta cgc acg etc 
Val He Tyr Gly Arg He Leu Ser Val Leu Thr Glu Leu Arg Thr Leu 
545 550 555 



1861 



ggc acg caa aac tec aac atg tgc ate teg ctg aag ctg aag aac agg 
Gly Thr Gin Asn Ser Asn Met Cys He Ser Leu Lys Leu Lys Asn Arg 
560 565 570 575 



1909 



aag ctg ccg ccg ttc etc gag gag ate tgg gac gtg gcg gag gtg gec 
Lys Leu Pro Pro Phe Leu Glu Glu He Trp Asp Val Ala Glu Val Ala 
580 585 590 



1957 



acg acg cat ccc acg gtg ctg ccg ccc acc aac ccg gtg gtg eta 
Thr Thr His Pro Thr Val Leu Pro Pro Thr Asn Pro Val Val Leu 
595 600 605 



2002 



tagcctccgc 
ctgatccgta 
atatatagtg 
getgegtegg 
attgaggece 
aaactgaagt 
gacttctcat 
tatatgaaga 
tttaccaaat 
atcgcagtgt 
gttttggaat 
ttttaatttc 



ccgccccagg 
ttcggtgaca 
ttgaeggega 
tcgcgccctt 
caagaacegg 
aattaatata 
ggtctcaata 
gatgtttcta 
taatgaagcg 
cggccgcgtg 
atgtttgttt 
gattcgegtt 



agagaacget 
gattagtgat 
ggcccgtccg 
gcgaccacga 
aagectaact 
ttaacagtta 
aggeggaegt 
ttggatggaa 
tcgggtgtag 
ctctcaccaa 
atatatagtt 
cattcgtcct 



catagactgg 
tatatgtgtt 
gccccgtact 
taagactact 
aagatacgac 
acacaggttc 
ataagagtta 
tgcgtgatgt 
accttttgta 
aagcgtgcgg 
tatgtgtgag 
gtgetegcta 



ctagttttag 
gttgaacgtt 
tgtttcgttt 
ttctataagt 
ggtttatttt 
aaagcagtta 
catagagaat 
aaagttgata 
tgtgatgtgg 
tegaegctaa 
gtgttatcgt 
ctcagattt 



tgaacgtgcg 
tggagagtat 
ctgaceggat 
aegtctctaa 
atcacagagg 
ggccgctccg 
aaaataataa 
gttattttta 
cgaggagtgg 
tagtgcgatg 
gtcccgtcaa 



2062 
2122 
2182 
2242 
2302 
2362 
2422 
2482 
2542 
2602 
2662 
2711 



<210> 
<211> 
<212> 
<213> 



3 

606 
PRT 

Bombyx roori 



<400> 
Met Arg Val 

1 

Ala Asp Glu 
Arg Glu Lys 

•r.-.v'**.- ■>• «--.'35 r ' ,r -' 

He Thr Asp 
50 

Trp Ser Asn 
65 

Ser Glu Val 

Met Ser Pro 

Trp Ser Tyr 
115 

Gly Ala Cys 



Glu Asn Val 
5 

Trp Cys Met 
20 

Ser Glu Val 

Ala Gly Ala 

Asn Gly Gly 
70 

Thr Ser Ser 
65 

Glu Ser Leu 
100 

Asp Asp Gly 
Asn Met Gin 



Asp Asn 

Ser Val 

Lys Ala 

Tyr Asp 
55 

Phe Pro 

Ser Ala 

Ala Ser 

He Thr 
120 
Gin Gin 



val Ser Phe Ala 
10 

Glu Thr Arg Leu 
25 

Tyr Val Gly Gly 

Ala Leu Phe Asp 
60 

Leu Arg Met Leu 
75 

Leu Gly Leu Pro 
90 

Pro Glu Tyr Arg 
105 

Tyr Asn Thr Ala 
Gin Leu Gin Pro 



Leu Asn 

Asp Ser 

30 
Cys Pro 

.45.:, .7^.- 

Met Arg 

Glu Glu 

Pro Ala 

Ala Leu 
110 
Gin Ser 
12S 

Gin Gin 



Gly Arg 
15 

Leu Val 

Ser Val 

Arg Arg 

Ser Ser 

80 
Met Val 
95 

Glu Leu 
Leu Leu 
Pro His 
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130 135 




14 0 










Pro 


&1 a Pro Prn Thr* Tah Dy-n TViy* Ma^ 


rro 


Leu Pro 


Met 


rro 


Pro 


inr 


145 


150 




155 








1DU 


Thr 


Ptti T.vtq ^pt* Rlii ZVqyi nin Cer M»t* 
t J>w Ajy o ^ci vi u itsji viu set net- 




Ser Gly 


Arg 




fit i« 

ulu 


Leu 




165 


170 








J. /D 




Ser 


t *-\J r\-L. a. act OCX. XXC noil vJ-y uys 


act 


Ala Asp 


ax a 


Asp 


Ala 


Arg 




180 185 








190 






Arg 


win ii/b -wy*» uiy rto nia rro Arg 


bin 




LrJLU 


Leu 


vys 


Leu 




195 200 






one 








Val 


*-y» «iy nop Arg /i± ot aer vJ-y iyr 


rilS 


Tyr Asn 


TV 1 _ 

Ala 


Leu 


Thr 


Cys 




210 215 














Glu 


uAy *-yt* i->y y wiy rne rne Arg Arg 


Ser 


vaj. inr 


Lys 


Asn 


Ala 


Val 


225 


230 




O 1 c 








24 0 


Tyr 


lie v,yo Jjyo rile uiy nlS nia v.ys 


ni it 


Met Asp 


Met 


Tyr 


Met 


Arg 




245 


250 












Arg 


T.vs C\rR (31 n f!l \i ("V/q Ave* T.^n f.vc 
xjy o vyg vj-i. ii uiu ^ya mvj ucu uys 


Lys 


Cys Leu 


Ala 


vai 


Gly 


i*iet 




260 

« w V 3 














Arg 


p>-o Gin rve VaI ti p nin nln Pv^ 

rtu v.yt» vctx lie uin uiu rio 


Oct 


Lys Asn 


Lys 


Asp 


Arg 


m n 
uin 




275 280 






285 








Arg 


f!l n T .\r T ,v/ o 7\ o f\ T .vf C f3"lvr T "1 *» T.aii 
vjih uy o jjy s no£j uya uiy lie jjcu 


Leu 


rro vai 


oer 


i nr 


Thr 


l nr 




290 295 




300 










Val 


r\Ofcf CTA£> lie l> A i. r£LS XXC 1 (C L. 


Gin 


Cys Asp 


Pro 


riu 


Pro 


rro 


305 


310 




3 15 








320 


Glu 


Ala Ala Arg lie His Glu Val Val 


Pro 


Arg Tyr 


Leu 


Ser 


Glu 


Lys 




325 


330 








335 




Leu 


Met Glu Gin Asn Arg Gin Lys Asn 


lie 


Pro Pro 


Leu 


Ser 


Ala 


As 




340 345 








350 






Gin 


Lys Ser Leu He Ala Arg Leu Val 




Tyr Gin 
yr n 


Glu 


Gly 


Tyr 


Glu 




355 360 






365 








Gin 


Pro Ser Asp Glu Asp Leu Lys Arg 


Val 


Thr Gin 


Thr 


Trn 
irp 


Gin 


Ser 




370 375 




380 










Asp 


Glu Glu Asp Glu Glu Ser Asp Leu 


Pro 


Phe Arg 


Gin 


He 


Thr 


Glu 


385 


390 




395 








400 


Met 


Thr He Leu Thr Val Gin Leu He 


Val 


Glu Phe 


Ala 


- 

ys 


Glv 


Leu 




405 


410 








415 




Pro 


Gly Phe Ser Lys He Ser Gin Ser 


ASp 


Gin He 


Thr 


Leu 


Leu 


ys 




420 425 








430 






Ala 


Ser Ser Ser Glu Val Met Met Leu 


Arg 


Val Ala 


Arg 




Tvr 


Asp 




435 440 






445 








Ala 


Ala Ser Asp Ser Val Leu Phe Ala 


Asn 


Asn Lys 


Ala 


Tvr 
j.y t 


Thr 






450 455 




460 










Asp 


Asn Tyr Arg Gin Gly Gly Met Ala 


Tyr 


Val He 


Glu 


Asp 


Leu 


Leu 


465 


.470 




475 








480 


His 


Phe Cys Arg Cys Met Phe Ala Met 


Gly 


Met Asp 


Asn 


Val 


His 


Phe 




485 


490 








495 




Ala 


Leu Leu Thr Ala lie Val lie Phe 


Ser 


Asp Arg 


Pro 


Gly 


Leu 


Glu 




500 505 








510 






Gin 


Pro Ser Leu Val Glu Glu He Gin 


Arg 


Tyr Tyr 


Leu 


Asn 


Thr 


Leu 




515 520 






525 








Arg 


He Tyr He He Asn Gin Asn Ser 


Ala 


Ser Ser 


Arg 


Cys 


Ala 


Val 














lie Tyr Gly Arg lie Leu Ser Val Leu 


Thr 


Glu Leu 


Arg 


Thr 


Leu 


Gly 


545 


550 




555 








560 


Thr Gin Asn Ser Asn Met Cys He Ser 


Leu 


Lys Leu 


Lys 


Asn 


Arg 


Lys 




565 


570 








575 




Leu 


Pro Pro Phe Leu Glu Glu He Trp Asp Val Ala Glu 


Val 


Ala 


Thr 




580 585 








590 






Thr 


His Pro Thr Val Leu Pro Pro Thr 


Asn 


Pro Val 


Val 


Leu 








595 600 






605 









<210> 4 
<211> 17 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> GAM response element 



<4G0> 4 
cggaggactg tcctccg 

<210> 5 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<400> 5 

gagagaagct tatggcgcgc ccgaccgatg 

<21Q> 6 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<400> 6 

cacacacgcg tgtactcgtc aattccaag 

<210> 7 
<211> 1577 
<212> DNA 

<213> Artificial Sequence 
<400> 7 

atggcgcgcc cgaccgatgt cagcctgggg 

gcgatggcgc atgccgacgc gctagacgat 

tccccgggtc cgggatttac cccccacgac 

gacttcgagt ttgagcagat gtttaccgat 

gatagtttag tgcgagaaaa aagtgaagtg 

atcacggatg ctggagcgta tgacgcgctc 

ggtggcttcc cgctgcgaat gcttgaagag 

ctgggtttgc caccggccat ggttatgtcg 

gccctcgagc tatggagcta cgatgacgga 

ggtgcatgca atatgcaaca gcaacagcta 

acgctcccca cgatgccttt accaatgcct 

atgtcatcag gtcgagagga actttcgccg 

gctgacgcca gacggcagaa gaaaggtcct 

gtctgcggcg acagagcctc cggataccac 

ggattcttca ggcggagtgt caccaaaaac 

tgtgaaatgg atatgtacat gaggaggaaa 

gcggtaggaa tgaggcctga atgtgtcata 

agacaaaaga aagacaaagg aatattatta 

atgcccccga tcatgcaatg tgatccacct 
• glrcccg aggt^a t-ct t tegga- gaag cfega fcg- 

ttgtcggcga atcagaagtc tctgatcgcg 

cagccctccg acgaggatct caaaagagta 

gaggaatccg atctaccctt ccgccagatc 

atcgtcgagt tcgccaaggg tctaccgggc 

accttattaa aagcctcgtc cagcgaggtg 

gccgcgtccg acagcgtgct gttcgccaac 

caaggcggca tggcctacgt catcgaagac 

atgggcatgg acaatgtgca ctttgcactg 

cccgggctcg agcagccgtc gctggtagaa 

cgaatttaca tcatcaacca gaacagcgcg 

atcctgagcg tgctgaccga gctacgcacg 



17 



gacgagctcc acttagacgg 
ttcgatctgg acatgttggg 
tccgccccct acggcgctct 
gcccttggaa ttgacgagta 
aaagcctacg tcggaggatg 
ttcgacatga gacgccgctg 
agctcttcag aagtgacatc 
ccggaatcct tggcgtcgcc 
atcacttata atacagccca 
caacctcagc aaccacatcc 
cccacaacac cgaaatcaga 
gcttcaagca taaatggctg 
gcacctcgac agcaagagga 
tacaacgcac tgacgtgtga 
gcagtatata tttgtaaatt 
tgtcaagagt gtcgattaaa 
caggagccca gtaaaaataa 
cctgttagta cgaccacagt 
ccgcccgagg ccgccaggat 
gagcagaaca^ggcagaagaa, 
aggctcgtgt ggtaccagga 
acgcagactt ggcagtcgga 
acggagatga cgatcttaac 
ttttcgaaga tatcacagtc 
atgatgctgc gggtggcgag 
aacaaggcgt acacgcgcga 
ctcctacact tctgccggtg 
ctcacggcca tcgttatatt 
gagatccaga gatactacct 
tcgtcgcgct gcgccgtgat 
ctcggcacgc aaaactccaa 



cgaggacgtg 

ggacggggat 

ggatatggcc 

cacgcgttta 

tccctcggta 

gtctaataac 

gtcttcggca 

cgagtatcga 

gtctctgctg 

agcaccaccg 

aaatgaatcg 

cagtgctgat 

gctatgtctt 

aggatgcaaa 

tggacatgcc 

gaaatgtcta 

agacaggcaa 

cgaagaccac 

tcacgaagtc 

^catacpa.cpa.. 

gggatatgag 

tgaagaggac 

ggtccagttg 

tgatcaaatc 

gcgatacgac 

caactaccgc 

catgttcgcg 

ctcagatcgg 

gaacacgttg 

ctacggcagg 

catgtgcatc 



30 



29 



€0 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 

1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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tcgctgaagc tgaagaacag gaagctgccg ccgttcctcg aggagatctg ggacgtggcg 1920 

gaggtggcca cgacgcatcc cacggtgctg ccgcccacca acccggtggt gctatag 1977 

<210> B 
<211> 9072 
<212> DNA 

<213> Artificial Sequence 
<400> 8 

tatggaggga tacccgtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat 60 

ttttctaaat acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc 120 

aataatattg aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct 180 

tttttgcggc attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag 240 

atgctgaaga tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta 300 

agatccttga gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc 360 

tgctatgtgg cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca 420 

tacactattc tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg 480 

atggcatgac agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg 540 

ccaacttact tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca 600 

tgggggatca tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa 660 

acgacgagcg tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa 720 

ctggcgaact acttactcta gctccccggc aacaattaat agactggatg gaggcggata 780 

aagttgcagg accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat 840 

ctggagccgg tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc 900 

cctcccgtat cgtagttatc tacacgacgg ggagtcaggc aactacggat gaacgaaata 960 

gacagatcgc tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt 1020 

actcatatat actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga 1080 

agatcctttt tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag 1140 

cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa 1200 

tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag 1260 

agctaccaac tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg 1320 

tccttctagt gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat 1380 

acctcgctct gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta 1440 

ccgggttgga ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg 1500 

gttcgtgcac acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc 1560 

gtgagctatg agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa 1620 

gcggcagggt cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc 1680 

tttatagtcc tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt 1740 

caggggggcg gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct 1800 

tttgctggcc ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc 1860 

gtattaccgc ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg 1920 

agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt 1980 

ggccgattca ttaatgcagc tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc 2040 

gcaacgcaat taatgtgagt tagctcactc attaggcacc ccaggcttta cactttatgc 2100 

ttccggctcg tatgttgtgt ggaattgtga gcggataaca atttcacaca ggaaacagct 2160 

atgaccatga ttacgccaag cgcgcaatta accctcacta aagggaacaa aagctggagc 2220 

tagcggccgc aataaaataa aagattttat ttagtctcca gaaaaagggg ggaatgaaag 2280 

accccacctg taggtttggc aagctagctt aagtaacgcc attttgcaag gcatggaaaa 2340 

atacataact gagaatagag aagttcagat caaggtcagg aacagatgga acagctgaat 2400 

atgggccaaa caggatatct gtggtaagca gttcctgccc cggctcaggg ccaagaacag 2460 
M * ^ — ^^v^g^^^g^ -egaat ^ g gg , ccaaarcagga-t atetgtggt -aagcagt tec :tgccceggct v^w25 

cagggecaag aacagatggt ccccagatgc ggtccagccc tcagcagttt etagaacaag 2580 

ggttcaatgc acttgtcget agaacaaggg ttcaatgeae ttgtcgctag aacaagggtt 2640 

caatgeaett gtcgctagaa caagggttca atgcacttgt cgagaaccat cagatgtttc 2700 

cagggtgccc caaggacctg aaatgaccct gtgecttatt tgaactaacc aatcagttcg 2760 

cttctcgctt ctgttcgcgc gcttctgctc cccgagctca ataaaagagc ccacaacccc 2820 

teactegggg cgccagtcct ccgattgact gagacgegtc ggggtacccg tattcccaat 2880 

aaagectett gctgttgcat ccgaatcgtg gaetegctga tccttgggag ggtctcctca 2940 

gattgattga ctgcccacct egggggtett tcatttggag gttccaccga gattggagac 3000 

ccctgcc tag ggaccaccga ccccccccgc egggaggtaa gctggccagc ggtcggtttc 3060 

gtgtctgtct ctgtctttgt gcgtgtttgt gccggcatct aaCgtttgcg cctgcgtctg 3120 

tactagttag ctaactagct ctgtatctgg cggacccgtg gtggaactga cgagttctga 3180 
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acacccggcc gcaaccctgg gagacgtccc agggactttg ggggccgttt ttgtggcccg 3240 

acctgaggaa gggagtcgat gtggaatccg accccgtcag gatatgtggt tctggtagga 3300 

gacgagaacc taaaacagtt cccgcctccg tctgaatttt tgctttcggt ttggaaccga 3360 

agccgcgcgt cttgtctgct gcagcatcgt tctgtgttgt ctctgtctga ctgtgtttct 3420 

gtatttgtct gaaaattagg gccagactgt taccactccc ttaagtttga ccttaggtca 3480 

ctggaaagat gtcgagcgga tcgctcacaa ccagtcggta gatgtcaaga agagacgttg 3540 

ggttaccttc tgctctgcag aatggccaac ctttaacgtc ggatggccgc gagacggcac 3600 

ctttaaccga gacctcatca cccaggttaa gatcaaggtc ttttcacctg gcccgcatgg 3660 

acacccagac caggtcccct acatcgtgac ctgggaagcc ttggcttttg acccccctcc 3720 

ctgggtcaag ccctttgtac accctaagcc tccgcctcct cttcctccat ccgccccgtc 3780 

tctccccctt gaacctcctc gttcgacccc gcctcgatcc tccctttatc cagccctcac 3840 

tccttctcta ggcgccggaa ttcggcgcgt gtggcctcga acaccgagcg accctgcagc 3900 

gacccgctta acagcgtaag cttgtcgacg gtatcgataa gcttatggcg cgcccgaccg 3960 

atgtcagcct gggggacgag ctccacttag acggcgagga cgtggcgatg gcgcatgccg 4020 

acgcgctaga cgatttcgat ctggacatgt tgggggacgg ggattccccg ggtccgggat 4080 

ttacccccca cgactccgcc ccctacggcg ctctggatat ggccgacttc gagtttgagc 4140 

agatgtttac cgatgccctt ggaattgacg agtacacgcg tttagatagt ttagtgcgag 42Q0 

aaaaaagtga agtgaaagcc tacgtcggag gatgtccctc ggtaatcacg gatgctggag 4260 

cgtatgacgc gctcttcgac atgagacgcc gctggtctaa taacggtggc ttcccgctgc 4320 

gaatgcttga agagagctct tcagaagtga catcgtcttc ggcactgggt ttgccaccgg 4380 

ccatggttat gtcgccggaa tccttggcgt cgcccgagta tcgagccctc gagctatgga 4440 

gctacgatga cggaatcact tataatacag cccagtctct gctgggtgca tgcaatatgc 4500 

aacagcaaca gctacaacct cagcaaccac atccagcacc accgacgctc cccacgatgc 4560 

ctttaccaat gcctcccaca acaccgaaat cagaaaatga atcgatgtca tcaggtcgag 4620 

aggaactttc gccggcttca agcataaatg gctgcagtgc tgatgctgac gccagacggc 4680 

agaagaaagg tcctgcacct cgacagcaag aggagctatg tcttgtctgc ggcgacagag 4740 

cctccggata ccactacaac gcactgacgt gtgaaggatg caaaggattc ttcaggcgga 4800 

gtgtcaccaa aaacgcagta tatatttgta aatttggaca tgcctgtgaa atggatatgt 4860 

acatgaggag gaaatgtcaa gagtgtcgat taaagaaatg tctagcggta ggaatgaggc 4920 

ctgaatgtgt catacaggag cccagtaaaa ataaagacag gcaaagacaa aagaaagaca 4980 

aaggaatatt attacctgtt agtacgacca cagtcgaaga ccacatgccc ccgatcatgc 5040 

aatgtgatcc acctccgccc gaggccgcca ggattcacga agtcgtcccg aggtatcttt 5100 

cggagaagct gatggagcag aacaggcaga agaacatacc accattgtcg gcgaatcaga 5160 

agtctctgat cgcgaggctc gtgtggtacc aggagggata tgagcagccc tccgacgagg 5220 

atctcaaaag agtaacgcag acttggcagt cggatgaaga ggacgaggaa tccgatctac 5280 

ccttccgcca gatcacggag atgacgatct taacggtcca gttgatcgtc gagttcgcca 5340 

agggtctacc gggcttttcg aagatatcac agtctgatca aatcacctta ttaaaagcct 5400 

cgtccagcga ggtgatgatg ctgcgggtgg cgaggcgata cgacgccgcg tccgacagcg 5460 

tgctgttcgc caacaacaag gcgtacacgc gcgacaacta ccgccaaggc ggcatggcct 5520 

acgtcatcga agacctccta cacttctgcc ggtgcatgtt cgcgatgggc atggacaatg 5580 

tgcactttgc actgctcacg gccatcgtta tattctcaga tcggcccggg ctcgagcagc 5640 

cgtcgctggt agaagagatc cagagatact acctgaacac gttgcgaatt tacatcatca 5700 

accagaacag cgcgtcgtcg cgctgcgccg tgatctacgg caggatcctg agcgtgctga 5760 

ccgagctacg cacgctcggc acgcaaaact ccaacatgtg catctcgctg aagctgaaga 5820 

acaggaagct gccgccgttc ctcgaggaga tctgggacgt ggcggaggtg gccacgacgc 5880 

atcccacggt gctgccgccc accaacccgg tggtgctata gcctccgccc gccccaggag 594 0 

agaacgctca tagactggct agttttagtg aacgtgcgct gatccgtatt cggtgacaga 6000 

ttagtgatta tatgtgttgt tgaacgtttg gagagtatat atatagtgtt atcgaattcc 6060 

gcccctctcc ctcccccccc cctaacgtta ctggccgaag ccgcttggaa taaggccggt 6120 

gtgcgtttgt ctatatgtta ttttccacca tattgccgtc ttttggcaat gtgagggccc 6180 

gaatgcaagg tctgttgaat gtcgtgaagg aagcagttcc tctggaagct tcttgaagac 6300 

aaacaacgtc tgtagcgacc ctttgcaggc agcggaaccc cccacctggc gacaggtgcc 6360 

tctgcggcca aaagccacgt gtataagata cacctgcaaa ggcggcacaa ccccagtgcc 6420 

acgttgtgag ttggatagtt gtggaaagag tcaaatggct ctcctcaagc gtattcaaca 6480 

aggggctgaa ggatgcccag aaggtacccc attgtatggg atctgatctg gggcctcggt 654 0 

gcacatgctt tacgtgtgtt tagtcgaggt taaaaaacgt ctaggccccc cgaaccacgg 6600 

ggacgtggtt ttcctttgaa aaacacgatg ataatatggc cacaaccatg gcatgattga 6660 

acaagatgga ttgcacgcag gttctccggc cgcttgggtg gagaggctat tcggctatga 6720 

ctgggcacaa cagacaatcg gctgctctga tgccgccgtg ttccggctgt cagcgcaggg 6780 

gcgcccggtt ctttttgtca agaccgacct gtccggtgcc ctgaatgaac tgcaggacga 6840 

ggcagcgcgg ctatcgtggc tggccacgac gggcgttcct tgcgcagctg tgctcgacgt 6900 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02683 PCT/US98/14215 



tgtcactgaa gcgggaaggg actggctgct attgggcgaa gtgccggggc aggatctcct 6960 

gtcatctcac cttgctcctg ccgagaaagt atccatcatg gctgatgcaa tgcggcggct 7020 

gcatacgctt gatccggcta cctgcccatt cgaccaccaa gcgaaacatc gcatcgagcg 7080 

agcacgtact cggatggaag ccggtcttgt cgatcaggat gatctggacg aagagcatca 7140 

ggggctcgcg ccagccgaac tgttcgccag gctcaaggcg cgcatgcccg acggcgagga 7200 

tctcgtcgtg acccatggcg atgcctgctt gccgaafcatc atggtggaaa atggccgctt 7260 

ttctggattc atcgactgtg gccggctggg tgtggcggac cgctatcagg acatagcgtt 7320 

ggctacccgt gatattgctg aagagcttgg cggcgaatgg gctgaccgct tcctcgtgct 73 BO 

ttacggtatc gccgctcccg attcgcagcg catcgccttc tatcgccttc ttgacgagtt 7440 

cttctgagcg ggactctggg gttcgaacct gtaggtttgg caagctagaa caagggttca 7500 

atgcacttgt cgctagaaca agggttcaat gcacttgtcg ctagaacaag ggtecaatgc 7560 

acttgtcgct agaacaaggg ttcaatgcac ttgtcgagaa ccatgggtcg agtaggcgtg 7620 

tacggtggga ggcctatata agcagagctc gtttagtgaa ccgtcagatc gcctggagac 7680 

gccatccaac gctgttttgc ctccaaaaga caccgggacc gaggccgcct gggcccgttt 7740 

aaacatcgat aaaataaaag attttattta gtctccagaa aaagggggga atgaaagacc 7800 

ccacctgtag gtttggcaag ctagcttaag taacgccatt ttgcaaggca tggaaaaata 7860 

cataactgag aatagagaag ttcagatcaa ggtcaggaac agatggaaca gctgaatatg 7920 

ggccaaacag gatatctgtg gtaagcagtt cctgccccgg ctcagggcca agaacagatg 7980 

gaacagctga atatgggcca aacaggatat ctgtggtaag cagttcctgc cccggctcag 8040 

ggccaagaac agatggtccc cagatgcggt ccagccctca gcagtttcta gaacaagggt 8100 

tcaatgcact tgtcgctaga acaagggttc aatgcacttg tcgctagaac aagggttcaa 8160 

tgcacttgtc gctagaacaa gggttcaatg cacttgtcga gaaccatcag atgtttccag 8220 

ggtgccccaa ggacctgaaa tgaccctgtg ccttatttga actaaccaat cagttcgctt 8280 

ctcgcttctg ttcgcgcgct tctgctcccc gagctcaata aaagagccca caacccctca 8340 

ctcggggcgc cagtcctccg attgactgag tcgcccgggt acccgtattc ccaataaacc 8400 

tcttgctgat gcatggtacc caattcgccc tatagtgagt cgtattacgc gcgctcactg 8460 

gccgtcgttc tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt 8520 

gcagcacatc cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct 8580 

tcccaacagt tgcgcagcct gaatggcgaa tggaaattgt aagcgttaat attttgttaa 8640 

aattcgcgtt aaatttttgt taaatcagct cattttttaa ccaataggcc gaaatcggca B700 

aaatccctta taaatcaaaa gaatagaccg agatagggtt gagtgttgtt ccagtttgga 8760 

acaagagtcc actattaaag aacgtggact ccaacgtcaa agggcgaaaa accgtctatc 8820 

agggcgatgg cccactacgt gaaccatcac cctaatcaag ttttttgggg tcgaggtgcc 8880 

gtaaagcact aaatcggaac cctaaaggga gcccccgatt tagagcttga cggggaaagc 8940 

cggcgaacgt ggcgagaaag gaagggaaga aagcgaaagg agcgggcgct agggcgctgg 9000 

caagtgtagc ggtcacgctg cgcgtaacca ccacacccgc cgcgcttaat gcgccgctac 9060 

agggcgcgtc ag 9072 
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